Published on July 10, 2025
In AI News

Google Launches T5Gemma to Reclaim Encoder-Decoder Architecture Benefits

Google’s new AI models take a new approach with an older architecture.

By Ankush Das

Google has launched T5Gemma, a new collection of encoder-decoder large language models (LLMs) that promise improved quality and inference efficiency compared to their decoder-only counterparts. It is based on the Gemma 2 framework.

Unlike the current trend that favours decoder-only LLMs, T5Gemma revisits the classic encoder-decoder architecture used in models like T5. The company introduced an adaptation technique that converts pretrained decoder-only models into encoder-decoder ones.

“We study a novel problem: adapting pretrained decoder-only LLMs to encoder-decoder, with the goal of leveraging the strengths of both approaches to achieve a more favourable quality efficiency trade-off,” the research paper mentioned.

The researchers further highlight that this adaptation not only enables inheriting the capability of decoder-only LLMs but also reduces the computational demand compared to pretraining from scratch.

“Can we build top-tier encoder-decoder models based on pretrained decoder-only models? We answer this question by exploring model adaptation,” the company explained in the blog post.

T5Gemma includes both newly trained T5-sized models, ranging from small to XL, and adapted Gemma 2 models with 2B and 9B parameters. It also offers unbalanced combinations such as a 9B encoder with a 2B decoder, aimed at optimising performance for tasks where input understanding is more important than output complexity.

According to benchmark results shared by Google, T5Gemma dominates the quality-inference efficiency Pareto frontier. On SuperGLUE and GSM8K, the models outperform comparable decoder-only models in both accuracy and latency. For example, T5Gemma 9B-9B delivered higher GSM8K accuracy than Gemma 2 9B while maintaining similar latency.

The gains extend beyond pretraining. After instruction tuning, T5Gemma models showed dramatic improvements. The 2B-2B model’s MMLU score jumped 12 points, while GSM8K accuracy rose from 58.0% to 70.7%, highlighting the architecture’s responsiveness to fine-tuning.

Google has released a wide range of T5Gemma checkpoints, including pretrained and instruction-tuned variants, with multiple training objectives such as PrefixLM and UL2.

The models are now available on Hugging Face, Kaggle, and Vertex AI for further experimentation and deployment.

📣 Want to advertise in AIM? Book here

Ankush Das

I am a tech aficionado and a computer science graduate with a keen interest in AI, Coding, Open Source, Global SaaS, and Cloud. Have a tip? Reach out to ankush.das@aimmediahouse.com

Google’s Try-On Feature: Good for Indian D2C Growth or Just Another Tool?

Alphabet to Acquire Intersect to Advance Data Centre Capacity, Energy Innovation

Google Launches Gemini 3 Flash, Promises Faster Performance and Lower Costs

Google Launches CC, an AI Agent to Manage Email and Daily Tasks

Google Commits $8 Million to India’s AI Centres of Excellence, Backs Health, Language and Clean Energy Initiatives

Soon After $1 Bn OpenAI Deal, Disney Accuses Google of AI Copyright Infringement

Don’t Miss the Next Big Shift in AI.

Get one year subscription for ₹5999

Top 10 Companies That Crowned Hyderabad as India’s Greenfield GCC Leader in 2025

Telangana has attracted over 75 greenfield GCCs in 2025, compared with 40-plus in Karnataka.

The AI Coding Gold Rush Ends Where Harness Begins

“Only 30% of software engineering happens on the laptop. The real 70% starts after you commit the code,” says Jyoti

How Gradient-Boosting is Quietly Powering India’s Research Push

From groundwater and slopes to carbon sinks, tools like CatBoost are enabling Indian scientists to extract insights and drive sustainability.

India’s Data Centre Boom Is Running Into a Talent Wall

With capacity expected to more than double this decade, the industry is investing in training as graduates struggle to meet

This Firm Wants to be the ‘Next Big Disruptor’ in Networking

Arrcus positions itself as a horizontal software layer that can run across different types of networking hardware.

Will 2026 be the year of AI IPOs?

With CoreWeave’s listing and Fractal Analytics going for an IPO, an array of AI companies are now looking to raise

Fighting Deepfakes May Not Be a Technology Problem

Defenders must be active at all times, while attackers need only one opportunity.

India’s Data Centre Expansion Is Decentralising

Without compute buildup beyond metros, the next wave of digital adoption will be constrained

Download the easiest way to
stay informed

Flagship Events

Google Launches T5Gemma to Reclaim Encoder-Decoder Architecture Benefits

Happy Llama 2026 The Must-Attend Summit for AI Startups Now in Bangalore and San Francisco