MITB Banner

Google Challenges Meta’s Llama 2 with Lightweight Open Source LLM, Gemma

Gemma outperforms Llama 2 on several benchmarks, including MMLU, HellaSwag, and HumanEval.

Share

Google has unveiled Gemma, a new family of open models, leveraging the research and technology behind the existing Gemini models. The Gemma open models are released in two sizes, Gemma 2B and Gemma 7B, each offering pre-trained and instruction-tuned variants. 

Users can start working with Gemma today using free access in Kaggle and a free tier for Colab notebooks. Additionally, first-time Google Cloud users can avail themselves of $300 in credits. Researchers can also apply for Google Cloud credits of up to $500,000 to accelerate their projects.

Gemma outperforms Llama 2 on several benchmarks, including MMLU, HellaSwag, and HumanEval.

To support developer innovation and responsible use, Google is also providing a Responsible Generative AI Toolkit alongside the models. This toolkit includes essential tools for creating safer AI applications with Gemma, offering guidance and support for developers.

To facilitate widespread adoption, Gemma is compatible with major frameworks, including JAX, PyTorch, and TensorFlow through native Keras 3.0. The release includes ready-to-use Colab and Kaggle notebooks, integration with popular tools such as Hugging Face, MaxText, NVIDIA NeMo, and TensorRT-LLM. 

Gemma models can run on various platforms, from laptops and workstations to Google Cloud, with optimisation for industry-leading performance on NVIDIA GPUs and Google Cloud TPUs. 

This development comes after Google recently introduced Gemini 1.5 with a 1 million token context window — the largest ever seen in natural language processing models. In contrast, GPT-4 Turbo has a 128K context window, and Claude 2.1 has a 200K context window.

Share
Picture of Siddharth Jindal

Siddharth Jindal

Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.