Accelerating generative AI deployment with microservices

An exclusive webcast with AWS and NVIDIA on the transformative potential of
microservices for the deployment of generative AI models

In association with

How microservices can drive generative AI innovation

In this exclusive webcast, we delve into the transformative potential of portable microservices for the deployment of generative AI models. We explore how startups and large organizations are leveraging this technology to streamline generative AI deployment, enhance customer service, and drive innovation across domains, including chatbots, document analysis, and video generation.

Our discussion focuses on overcoming key challenges such as deployment complexity, security, and cost management. We also discuss how microservices can help executives realize business value with generative AI while maintaining control over data and intellectual property.

Learn from the experts

Approaches to get beyond proof of concept and into deployment using a microservices architecture

Developing a clear vision of generative AI solutions and the specific business problem they're solving

Advantages of a microservices architecture for speed, agility, and scaling

Featured speakers

Aman Shanbhag
Associate specialist solutions architect,
ML frameworks
Amazon Web Services

Bethann Noble
Product marketing manager,
enterprise software products
NVIDIA

Aman Shanbhag is an associate specialist solutions architect on the ML Frameworks team at Amazon Web Services, where he helps customers and partners with deploying ML training and inference solutions at scale. Before joining AWS, Aman graduated from Rice University with degrees in computer science, mathematics, and entrepreneurship.

Bethann Noble is a product marketing manager for enterprise software products at NVIDIA, including the NVIDIA AI Enterprise software platform with NVIDIA NIM. Previously, she held senior positions in marketing and product marketing at AI copilot startup Continual, AI-powered bot protection platform HUMAN Security, Cloudera, and IBM. Bethann has a bachelor’s degree in mathematics from the University of Texas at Austin.

About the hosts

MIT Technology Review

Founded at the Massachusetts Institute of Technology in 1899, MIT Technology Review is a world-renowned, independent media company whose insight, analysis, reviews, interviews, and live events explain the newest technologies and their commercial, social, and political impacts. MIT Technology Review derives authority from its relationship to the world's foremost technology institution and from its editors' deep technical knowledge, capacity to see technologies in their broadest context, and unequaled access to leading innovators and researchers. Our in-depth reporting reveals what’s going on now to prepare you for what’s coming next.

Learn more at technologyreview.com.

Amazon Web Services and NVIDIA

AWS and NVIDIA have collaborated since 2010 to continually deliver large-scale, cost-effective, and flexible GPU-accelerated solutions for customers. Spanning from the cloud to the edge, these innovations extend across infrastructure, software, and services to offer a full-stack solution that accelerates time to solution when building and deploying AI into production. With GPU-accelerated solutions available in multiple AWS Regions, customers can access the compute power that they need to achieve low latency, high performance, and high reliability.