Arcee AI Blog Posts
16 articles on Small Language Models and practical AI deployment
September 2025
Optimizing Arcee Foundation Models on Intel CPUs (opens in new tab)
August 2025
Deploy Arcee AFM-4.5B on Arm-based Google Cloud Axion with Llama.cpp (Arm Learning Path) (opens in new tab)
August 2025
An Amazon SageMaker Container for Hugging Face Inference on AWS Graviton
Happy to share my new GitHub project: “ An Amazon SageMaker Container for Hugging Face Inference on AWS Graviton ”.
July 2025
Deploy Arcee AFM-4.5B on Arm-based AWS Graviton4 with Llama.cpp (Arm Learning Path) (opens in new tab)
July 2025
Small And Mighty: Arcee AI Language Models Excel Across Yupp.ai Leaderboards
Originally published at https://www.arcee.ai/blog/arcee-ai-small-language-models-excel-across-yupp-ai-leaderboards
July 2025
Is Running Language Models on CPU Really Viable?
Originally published at https://www.arcee.ai/blog/arcee-ai-small-language-models-excel-across-yupp-ai-leaderboards
June 2025
Announcing the Arcee Foundation Model Family
Originally published at https://www.arcee.ai/blog/is-running-language-models-on-cpu-really-viable
June 2025
Arcee Conductor Wins LLM Application of the Year at 2025 AI Breakthrough Awards
Originally published at https://www.arcee.ai/blog/announcing-the-arcee-foundation-model-family
June 2025
Building an AI Retail Assistant at the Edge with Small Language Models and Intel Xeon CPUs
Originally published at https://www.arcee.ai/blog/building-an-ai-retail-assistant-at-the-edge-with-small-language-models-and-intel-xeon-cpus
June 2025
Breaking Down Model Vocabulary Barriers with Tokenizer Transplantation
Originally published at https://www.arcee.ai/blog/breaking-down-model-vocabulary-barriers-with-tokenizer-transplantation
June 2025
Releasing Five New Open-Weights Models
Originally published at https://www.arcee.ai/blog/breaking-down-model-vocabulary-barriers-with-tokenizer-transplantation
June 2025
Arcee Conductor and Zerve: Bringing Model Routing to AI and Data Science Workflows (opens in new tab)
May 2025
Arcee AI Small Language Models on Together AI and OpenRouter
Originally published at https://www.arcee.ai/blog/arcee-ai-small-language-models-on-together-ai-and-openrouter
May 2025
Running GenAI Inference with AWS Graviton and Arcee AI Models (opens in new tab)
May 2025
Enriching Inventory Data with Arcee Conductor (opens in new tab)
April 2025
The Case for Small Language Model Inference on ARM CPUs
Originally published at https://www.arcee.ai/blog/the-case-for-small-language-model-inference-on-arm-cpus