SageMaker Fridays Season 3, Episode 6 — Cost optimization with Machine Learning

In the first section, we have a chat with our special guest Greg Coquillo, a Technology Risk Manager working for Amazon. We walk through an automation project that he’s currently working on for a B2B customer operating in chemicals. In order to build material safety data sheets, the project automatically extracts image and text data from over 100,000 documents a month, using AI services like Amazon Textract, Amazon Comprehend and Amazon Translate. We’ll discuss the key phases of the project, its benefits, and best practices learned along the way.

In the second section, we dive into a large-scale computer vision workload running on SageMaker, and from image labeling to training to predicting, we’ll pull out all the stops to optimize cost. Along the way, you’ll learn about labeling with SageMaker Ground Truth, right-sizing your training infrastructure, training with Managed Spot Training and Pipe Mode, deploying with Elastic Inference, and more. Get ready to save money!


About the Author

Julien Simon is the Chief Evangelist at Arcee AI , specializing in Small Language Models and enterprise AI solutions. Recognized as the #1 AI Evangelist globally by AI Magazine in 2021, he brings over 30 years of technology leadership experience to his role.

With 650+ speaking engagements worldwide and 350+ technical blog posts, Julien is a leading voice in practical AI implementation, cost-effective AI solutions, and the democratization of artificial intelligence. His expertise spans open-source AI, Small Language Models, enterprise AI strategy, and edge computing optimization.

Previously serving as Principal Evangelist at AWS and Chief Evangelist at Hugging Face, Julien has authored books on Amazon SageMaker and contributed to the open-source AI ecosystem. His mission is to make AI accessible, understandable, and controllable for everyone.