Skip to main content

Automatically scale Amazon SageMaker endpoints for inference

Many customers have ML applications with intermittent usage patterns. As a result, customers end up provisioning for peak capacity up front, which results in idle capacity. In this session, learn how to use Amazon SageMaker to reduce costs for intermittent workloads and scale automatically based on your needs.