MITB Banner

Amazon makes MASSIVE announcements around a 51-language dataset 

The MASSIVE dataset and the Massively Multilingual NLU (MMNLU-22) competition and workshop will help researchers scale natural-language-understanding technology to every language on Earth.

Share

In the world of multilingual voice assistance, Amazon announced a new dataset called MASSIVE, a new competition using MASSIVE, and a workshop, Massively Multilingual NLU 2022. 

Imagine if everyone in the world could use voice AI systems such as Alexa in their native tongues. A promising approach to realising this vision is massively multilingual natural-language understanding (MMNLU). It is a paradigm where a single ML model can explain and understand input from many typologically diverse languages. This model can learn a shared data representation that spans languages and transfer knowledge from languages with abundant training data to those in which training data is scarce.

Amazon made three announcements related to MMNLU by releasing: 

  1. A new dataset called MASSIVE, composed of one million labelled utterances spanning 51 languages, along with open-source code, provides examples of performing massively multilingual NLU modelling and allows practitioners to re-create baseline results for intent classification and slot filling.
  2. A new competition using the MASSIVE dataset called Massively Multilingual NLU 2022 (MMNLU-22).
  3. To co-host a workshop at EMNLP 2022 in Abu Dhabi and online, also called Massively Multilingual NLU 2022.

Image: Amazon.science

Prem Natarajan, VP of Alexa AI Natural Understanding, said, “We are very excited to share this large multilingual dataset with the worldwide language research community. We hope the dataset will help researchers worldwide to drive new advances in multilingual language understanding that expand the availability and reach of conversational-AI technologies”.

 

 

Share
Picture of Poornima Nataraj

Poornima Nataraj

Poornima Nataraj has worked in the mainstream media as a journalist for 12 years, she is always eager to learn anything new and evolving. Witnessing a revolution in the world of Analytics, she thinks she is in the right place at the right time.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.