MITB Banner

Modular Announces Partnership with AWS, NVIDIA

ModCon 2023 was Modular's first developer conference.

Share

Modular Announces Partnership with AWS, NVIDIA
Listen to this story

At ModCon 2023, Modular, the company behind the Mojo programming language, announced an exclusive collaboration with Amazon Web Services (AWS). The partnership aims to extend the reach of the MAX Platform to AWS production services worldwide, ushering in advanced AI capabilities for a vast user base.

Check out all the key announcements made at ModCon 2023.

Notably, the AWS Marketplace becomes the exclusive venue for leveraging the MAX Platform on Graviton CPUs, Amazon’s ARM-based processors designed for compute-intensive workloads such as AI programs. The MAX Platform optimally enhances Graviton CPUs, delivering AI model execution with up to 5X higher performance and up to 80% cost savings compared to existing AI infrastructure.

Bratin Saha, AWS VP of machine learning & AI services, emphasised the significance of this collaboration in advancing AI capabilities. “At AWS, we are dedicated to shaping the future of AI by delivering services that reduce costs and accelerate progress for enterprises and startups. The MAX Platform amplifies this objective for millions of AWS customers, facilitating the rapid deployment of the latest GenAI innovations and traditional AI use cases,” remarked Saha.

One unique feature of the MAX Platform is its hardware portability, enabling seamless migration of workloads to Graviton without incurring any migration costs. This portability extends to the utilisation of MAX Serving and Mojo directly in the engine, offering full customizability and workload tuning options.

Furthermore, Modular announced a strategic technology partnership with NVIDIA to integrate the benefits of their accelerated compute platform into MAX, simplifying CPU+GPU development for AI developers. Key features unveiled include MAX Engine extensibility with Mojo, MAX Engine GPU support, the release of Mojo SDK v0.6, and the open-sourcing of Mojo documentation.

MAX GPU offers cutting-edge compatibility for NVIDIA H100, H200, A100, and L40 series GPU accelerators, as well as the recently introduced Grace CPU Superchip. This hardware integration extends across all facets of MAX, encompassing MAX Engine, MAX Serving, and Mojo, delivering unmatched heterogeneous computing capabilities to the field of AI. 

Developers can now leverage a unified toolchain that caters to a spectrum of AI applications, spanning from GenAI to various other AI scenarios. This approach unlocks innovative CPU+GPU programming models, promising unparalleled performance and cost-effectiveness.

“Developers everywhere are helping their companies adopt and implement generative AI applications that are customised with the knowledge and needs of their business,” said Dave Salvator, director of AI and Cloud at NVIDIA.

ModCon 2023, Modular’s inaugural developer conference, featured prominent figures in the AI landscape, including Bratin Saha (VP, AWS), Kari Ann Briski (VP, NVIDIA), Bryan Catanzaro, (VP, NVIDIA), Lex Fridman, (AI researcher, MIT), Shawn “Swyx” Wang (Latent.Space), Damien Sereni (director, Meta), Jeremy Howard (Fast.ai), and Michele Catasta, (VP, Replit).

Excitement surrounds this collaboration with early access to MAX on AWS Marketplace available at modul.ar/max, and further updates anticipated in Q1 2024. The partnership signifies a significant step towards ensuring the widespread availability of MAX, marking a pivotal moment in the evolution of AI capabilities.

Share
Picture of Mohit Pandey

Mohit Pandey

Mohit dives deep into the AI world to bring out information in simple, explainable, and sometimes funny words. He also holds a keen interest in photography, filmmaking, and the gaming industry.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.