Automatically scale Amazon SageMaker endpoints for inference

Many customers have ML applications with intermittent usage patterns. As a result, customers end up provisioning for peak capacity up front, which results in idle capacity. In this session, learn how to use Amazon SageMaker to reduce costs for intermittent workloads and scale automatically based on your needs.

Previous Video
Build custom deep learning environments with AWS Deep Learning Containers
Build custom deep learning environments with AWS Deep Learning Containers

In this session, learn how to use Deep Learning Containers to build your custom ML environment and how to i...

Next Video
Integrate ML into your analytics workloads
Integrate ML into your analytics workloads

Learn how with Amazon Redshift ML, you can take advantage of Amazon SageMaker, a fully managed ML service, ...