Deploying machine learning models for inference- AWS Virtual Workshop

8 Views

Thanks! Share it with your friends!

You disliked this video. Thanks for the feedback!

Published Feb 20, 2025

Maximizing inference performance while reducing cost is critical to delivering great customer experiences through ML. Amazon SageMaker provides a breadth and depth of fully managed deployment features to achieve optimal inference performance and cost at scale without the operational burden. In this episode, learn how to use SageMaker inference capabilities to quickly deploy ML models in production for any use case, including hyper-personalization, Generative AI, and Large Language Models (LLMs).

Learning Objectives:
* Objective 1: Learn about how to deploy ML models on Amazon SageMaker for inference.
* Objective 2: Discover the SageMaker inference endpoint options that fit your use case.
* Objective 3: Learn how to deploy Large Language Models (LLMs) for inference.

***To learn more about the services featured in this talk, please visit: https://aws.amazon.com/sagemaker/deploy/
****To download a copy of the slide deck from this webinar visit: https://pages.awscloud.com/Deploying-machine-learning-models-for-inference_2023_VW-0616-MCL_OD Subscribe to AWS Online Tech Talks On AWS:
https://www.youtube.com/@AWSOnlineTechTalks?sub_confirmation=1

Follow Amazon Web Services:
Official Website: https://aws.amazon.com/what-is-aws
Twitch: https://twitch.tv/aws
Twitter: https://twitter.com/awsdevelopers
Facebook: https://facebook.com/amazonwebservices
Instagram: https://instagram.com/amazonwebservices

☁️ AWS Online Tech Talks cover a wide range of topics and expertise levels through technical deep dives, demos, customer examples, and live Q&A with AWS experts. Builders can choose from bite-sized 15-minute sessions, insightful fireside chats, immersive virtual workshops, interactive office hours, or watch on-demand tech talks at your own pace. Join us to fuel your learning journey with AWS.

#AWS