21st-may-banner design

AWS Unveils Graviton4, Trainium2 for Faster, Affordable AI Model Building

AWS claims to have built over 2 million Graviton processors, with approximately 50,000 customers using them

Share

AWS Re:invent Adam VP
Listen to this story

At re:Invent in Las Vegas, Amazon Web Services (AWS) announced two new AI chips –AWS Graviton4 , AWS Trainium2. The new chips aim to provide advancements in price performance and energy efficiency for a wide range of customer workloads, including machine learning training and generative AI applications. 

Graviton4 offers up to 30% better compute performance, 50% more cores, and 75% more memory bandwidth than Graviton3. Trainium2 delivers up to 4x faster training than its first generation, with deployment capability in EC2 UltraClusters of up to 100,000 chips.

(Source: Business Wire)

David Brown, VP of Compute and Networking at AWS said that Graviton4 marks the fourth generation they have delivered in just five years, and is the most powerful and energy-efficient chip ever built. “Silicon underpins every customer workload, making it a critical area of innovation for AWS,” he added. 

AWS chief Adam Selipsky said that it has more than 50K customers for Graviton, and its other cloud providers are still just talking about making them, and are yet to deliver first server processors. At Ignite 2023, Microsoft recently launched Azure Maia 100 AI Accelerator, its first in-house custom AI system on a chip. 

Some of its customers leveraging AWS chips include Anthropic, Databricks, Datadog, Epic, Honeycomb, SAP and others. Naveen Rao, VP of generative AI at Databricks said that AWS Trainium gave them the scale and high performance needed to train our Mosaic MPT models, and at a low cost. 

“AWS Graviton4 instances are the fastest EC2 instances we’ve ever tested, and they are delivering outstanding performance across our most competitive and latency-sensitive workloads,” said Roman Visintine, lead cloud engineer at Epic Games. 

Juergen Mueller, CTO of SAP SE said that as part of the migration process of SAP HANA Cloud to AWS Graviton-based Amazon EC2 instances, we have already seen up to 35% better price performance for analytical workloads.

Graviron4-powered R8g instances are available today in preview, with general availability planned in the coming months. Check out here. Trainium2  is said to be available in Amazon Ec2 Trn2 instances Check it out here. 

Share
Picture of Tasmia Ansari

Tasmia Ansari

Tasmia is a tech journalist at AIM, looking to bring a fresh perspective to emerging technologies and trends in data science, analytics, and artificial intelligence.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe

Subscribe to our Youtube channel and see how AI ecosystem works.

There must be a reason why +150K people have chosen to follow us on Linkedin. 😉

Stay in the know with our Linkedin page. Follow us and never miss an update on AI!