MITB Banner

Meet the New Indic LLM, MahaMarathi 7B

Boasting 7 billion parameters, the model is built on Meta Llama-2+Mistral AI framework.

Share

Joining the league of indic LLMs like Telugu, Malayalam, Tamil and Oriya Llama, is MahaMarathi 7B. Boasting 7 billion parameters, the Marathi LLM is built on Meta Llama-2 and Mistral AI framework.

With computing resources and data provided by Microsoft for Startups funded company CourtEasy.ai, this open-source LLM is domain-adapted, continually pre-trained, and instruction fine-tuned using the Meta Llama-2 and Mistral AI framework.

The brains behind this research include Dr. Aakash Patil, postdoctoral researcher at Stanford University, Mrunmayee Shende, cofounder of CourtEasy.ai, and Niraj Singh, ML engineer at Inbound Health. 

To democratise ML research, the team has released the initial version of the pre-trained base model on Hugging Face, inviting developers, startups, and public and private organisations to innovate by developing fine-tuned models for various use cases. 

Marathi’s unique linguistic characteristics, complexity, and cultural context are addressed by MahaMarathi 7B, making it suitable for handling complex conversations and instructions. The language model is available for free on Hugging Face, promoting broader access and encouraging applications in various fields, including business and e-governance.

Marathi, spoken by over 83 million people predominantly in Maharashtra, is the 13th most spoken language globally and the third most common in India. Acknowledging Maharashtra’s significant economic contribution, with Marathi businesses and consumers contributing over 15% to India’s GDP, the MahaMarathi 7B aims to catalyse innovation in the region. The creators envision the potential impact of this Marathi LLM on diverse sectors such as skill training, education, healthcare, agriculture, environment, urban planning, and traffic management. 

The release of the Marathi LLM is a step towards making AI more accessible and applicable to non-English languages. The team plans to release instruction-tuned and preference-optimised models in the coming months.

Share
Picture of Shritama Saha

Shritama Saha

Shritama (she/her) is a technology journalist at AIM who is passionate to explore the influence of AI on different domains including fashion, healthcare and banks.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.