Indian Startup Sarvam AI Launches Hindi LLM, OpenHathi 

The model demonstrates robust performance across various Hindi tasks, comparable to, if not surpassing, GPT-3.5.
Indian AI startup Sarvam AI has released OpenHathi-Hi-v0.1, the first Hindi LLM in the OpenHathi series. Developed on a budget-friendly platform, the model, an extension of Llama2-7B, boasts GPT-3.5-like performance for Indic languages. OpenHathi, featuring a 48K-token extension of Llama2-7B’s tokenizer, underwent a two-phase training process. The first phase focused on embedding alignment, where Hindi embeddings were randomly initialized and aligned, followed by bilingual language modelling to teach the model cross-lingual attention across tokens. https://youtu.be/WKfVzJSDAd8 The model demonstrates robust performance across various Hindi tasks, comparable to, if not surpassing, GPT-3.5, while maintaining English proficiency. Sarvam AI's evaluation includes non-academic, real-
Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Picture of Siddharth Jindal
Siddharth Jindal
Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.
Related Posts
AIM Print and TV
Don’t Miss the Next Big Shift in AI.
Get one year subscription for ₹5999
Download the easiest way to
stay informed