Deploy ML models for inference at high performance and low cost

High-performance, cost-effective model deployment is critical to maximize the return on your machine learning (ML) investments. Amazon SageMaker provides the breadth and depth of fully managed deployment features for you to achieve optimal inference performance and cost while reducing the operational burden of deploying and managing models in production. In this session, learn how to use SageMaker inference capabilities to quickly deploy ML models in production at scale. Discover SageMaker deployment options, including infrastructure choices; real-time, serverless, asynchronous, and batch inference; single-model, multi-model, and multi-container endpoints; auto scaling; SageMaker Inference Recommender; model monitoring; and SageMaker MLOps integration. We also cover how to validate the performance of new ML models against production models to prevent costly outages.

Deploy ML models for inference at high performance and low cost

Connie Watkins Workman_V04_NoCountdown_SurveySlide

SEC105 Strengthen the security of your generative AI applications

SEC103 Deploying High Value Asset (HVA) workloads in AWS

SEC101 Amazon's culture of security and Canada's national cyber security strategy Partnering for suc

SEC102 How to develop responsible generative AI applications Key insights and Government of Canada's

DATA105 - Accelerate innovation using a modern data strategy on AWS

DATA101 Unleash the power of data and artificial intelligence (AI) Accelerating innovation in Canada

DATA103 Explore Retrieval Augmented Generation (RAG) and database services

TECH105 Optimizing for resilience Learnings from Canada's public sector customers

TECH103 How generative AI is transforming the customer experience through modern contact centres

LEAD104_Reinventing with generative AI on AWS

TECH102 Think big, start small, scale fast How the Canadian Army transformed operations on AWS

TECH101 From reactive to proactive How generative AI agents can transform government-citizen interac

LEAD102_People, processes, and culture Building the business and IT relationship to drive digital tr

LEAD103_Cloud financial operations best practices for the federal government

LEAD101_Why NASA adopted the cloud How to accelerate innovation and a culture of experimentation

FY24 Data Week Step 5 - Execute

FY24 Data Week Step 3 - Assess

FY24 Data Week Step 1 - Cultivate

FY24 Data Week Step 2 - Strategize