High-performance and cost-effective techniques, including real-time, asynchronous, and batch, are needed to scale model deployments to maximize your ML investments. In this session, learn the different inference options available in Amazon SageMaker, such as multi-container endpoints, inference pipelines, and multi-model endpoints as well as frameworks such as TensorFlow and PyTorch, Python-based backend servers, and C++/Go-based backend servers. Learn how to pick the best inference option for your ML use case so you can scale to thousands of models across your business.

Questions about
AWS AI/ML Solutions?
Contact support
Other content in this Stream

In this session, explore how AWS services can help you move from idea to production with ML and an end-to-end data strategy.

Discover the innovation stories changing the way we live, work, play, look after the planet, and accelerate space exploration.

Identity theft and the ability for fraudulent users to gain access to digital platforms is a prominent concern.

AWS Contact Center Intelligence (CCI) solutions empower you to elevate the customer experience, reduce agent attrition rate, and improve operational efficiency in the contact center of your choice.

Learn how organizations in the retail and media & entertainment industries can easily apply the same AI capabilities.

Organizations across all industries are still manually processing documents, which is time consuming, prone to error, and costly.

The desire to address this risk is leading to greater adoption of facial recognition to bolster the onboarding or know-your-customer (KYC) efforts of digital platforms.

As part of this session, Public Broadcasting Service (PBS) shares their personalization story and its impact.

With Amazon Kendra, you can build an intelligent search solution, powered by ML, to find accurate answers from the unstructured content in your enterprise.

In this session, learn how automation and AI services from AWS can help your customer service and media teams reclaim up to 95 percent of their time spent doing manual moderation.

Join this session to learn how to make the shift toward more automation and proactive mechanisms with ML-powered insights that can help your developer teams innovate faster.

Amazon SageMaker Canvas is a visual, point-and-click service that makes it easy for business analysts to build ML models and generate accurate predictions without writing code or having ML expertise.

Join this session to learn how to prepare data for ML in minutes using Amazon SageMaker. SageMaker offers tools to simplify data preparation so that you can label, prepare, and understand your data.

State-of-the-art models are rapidly increasing in size and complexity. These models can be difficult to train because of cost, time, and skill sets required to optimize memory and compute.

Amazon SageMaker provides all the tools and libraries you need to build ML models.

MLOps practices help data scientists & IT operations professionals collaborate & manage the production ML workflow, including data preparation & building and training, deploying, & monitoring models

In this session, explore how to choose the proper instance for ML training and inference based on model size, complexity, and performance requirements.

In this session, learn how to use Deep Learning Containers to build your custom ML environment and how to implement model training and inference with Deep Learning Containers in Amazon SageMaker.

In this chalk talk, dive into building computer vision (CV) applications at the edge for predictive maintenance, industrial IoT, and more.

Deep Java Library (DJL) is an open-source, high-level, engine-agnostic Java framework for deep learning.