MITB Banner

CognitiveLab Releases Indic LLM Leaderboard

Along with this, Adithya S Kolavi, the founder of CreativeLab has also unveiled the indic_eval evaluation framework.

Share

CognitiveLab Releases Indic LLM Leaderboard
Listen to this story

An Indic LLM leaderboard is finally here. CognitiveLab has released its Indic LLM Leaderboard for the growing number of Indic Language Models entering the scene without a uniform evaluation framework. 

The Indic LLM Leaderboard offers support for 7 Indic languages including Hindi, Kannada, Tamil, Telugu, Malayalam, Marathi, and Gujarati, providing a comprehensive assessment platform. Hosted on Hugging Face, it supports 4 Indic benchmarks initially, with plans for additional benchmarks in the future. 

Click here to check it out.

Along with this, Adithya S Kolavi, the founder of CognitiveLab has also unveiled the indic_eval evaluation framework which supports Arc Easy, Challenge, Hellaswag, MMLU, BoolQ, and Translation.

The leaderboard also seamlessly integrates with indic_eval, simplifying the process of uploading evaluation scores.

This entire system is deployed within India, ensuring robust security measures.

As an alpha release, the Leaderboard promises ongoing enhancements and tested features in subsequent updates. As of this release, base models ‘meta-llama/Llama-2-7b-hf’ and ‘google/gemma-7b’ have been added into the leaderboard to use as reference.

With a commitment to bolstering its capabilities, CognitiveLab aims to establish the Indic LLM Leaderboard as a pivotal tool for evaluating and advancing Indic Language Models.

The leaderboard operates by executing indic_eval on the chosen model, then transmitting the outcomes to a server for storage in a database. The Frontend Leaderboard subsequently accesses this server to retrieve the most recent models from the database, alongside their respective benchmarks and metadata. 

In contrast to the Open LLM leaderboard, this project draws inspiration from it but introduces standardized evaluation with common benchmarks due to computational resource limitations. Users can conduct evaluations on their GPUs, while the leaderboard acts as a centralized platform for model comparisons. 

To ensure reliability and consistency in the output, the company employed indictrans2 from AI4Bharat and other translation APIs to translate the benchmarking dataset into seven Indian languages.

In March, CognitiveLab introduced Ambari, an open-source Bilingual Kannada-English LLMs series. The initiative addresses the challenges posed by the dynamic landscape of LLMs, with a primary focus on bridging the linguistic gap between Kannada and English.

Share
Picture of Mohit Pandey

Mohit Pandey

Mohit dives deep into the AI world to bring out information in simple, explainable, and sometimes funny words. He also holds a keen interest in photography, filmmaking, and the gaming industry.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India