Skip to main content

Arcee AI Blog Posts

16 articles on Small Language Models and practical AI deployment

September 2025

Optimizing Arcee Foundation Models on Intel CPUs (opens in new tab)

August 2025

Deploy Arcee AFM-4.5B on Arm-based Google Cloud Axion with Llama.cpp (Arm Learning Path) (opens in new tab)

August 2025

An Amazon SageMaker Container for Hugging Face Inference on AWS Graviton

Happy to share my new GitHub project: “ An Amazon SageMaker Container for Hugging Face Inference on AWS Graviton ”.

July 2025

Deploy Arcee AFM-4.5B on Arm-based AWS Graviton4 with Llama.cpp (Arm Learning Path) (opens in new tab)

July 2025

Small And Mighty: Arcee AI Language Models Excel Across Yupp.ai Leaderboards

Originally published at https://www.arcee.ai/blog/arcee-ai-small-language-models-excel-across-yupp-ai-leaderboards

July 2025

Is Running Language Models on CPU Really Viable?

Originally published at https://www.arcee.ai/blog/arcee-ai-small-language-models-excel-across-yupp-ai-leaderboards

June 2025

Announcing the Arcee Foundation Model Family

Originally published at https://www.arcee.ai/blog/is-running-language-models-on-cpu-really-viable

June 2025

Arcee Conductor Wins LLM Application of the Year at 2025 AI Breakthrough Awards

Originally published at https://www.arcee.ai/blog/announcing-the-arcee-foundation-model-family

June 2025

Building an AI Retail Assistant at the Edge with Small Language Models and Intel Xeon CPUs

Originally published at https://www.arcee.ai/blog/building-an-ai-retail-assistant-at-the-edge-with-small-language-models-and-intel-xeon-cpus

June 2025

Breaking Down Model Vocabulary Barriers with Tokenizer Transplantation

Originally published at https://www.arcee.ai/blog/breaking-down-model-vocabulary-barriers-with-tokenizer-transplantation

June 2025

Releasing Five New Open-Weights Models

Originally published at https://www.arcee.ai/blog/breaking-down-model-vocabulary-barriers-with-tokenizer-transplantation

June 2025

Arcee Conductor and Zerve: Bringing Model Routing to AI and Data Science Workflows (opens in new tab)

May 2025

Arcee AI Small Language Models on Together AI and OpenRouter

Originally published at https://www.arcee.ai/blog/arcee-ai-small-language-models-on-together-ai-and-openrouter

May 2025

Running GenAI Inference with AWS Graviton and Arcee AI Models (opens in new tab)

May 2025

Enriching Inventory Data with Arcee Conductor (opens in new tab)

April 2025

The Case for Small Language Model Inference on ARM CPUs

Originally published at https://www.arcee.ai/blog/the-case-for-small-language-model-inference-on-arm-cpus