Published on July 29, 2021
In Global Tech

Google Releases New Version of Translatotron: Its End-to-end Speech Translation Model

The Translatotron 2 model consists of a source speech encoder, a target phoneme decoder, and a target mel-spectrogram synthesiser.

By Avi Gopani

Google released Translatotron, an end-to-end speech-to-speech translation model, in 2019. The tech giant claimed the single sequence-to-sequence model is the first end-to-end framework to directly translate speech from one language into speech in another language. The system was used to create synthesised translations of voices to ensure the sound of the original speaker is intact. But this feature had the potential to be misused to generate speech in a different voice and create deep fake voices. This month, researchers at Google published a paper detailing ‘Translatotron 2’, an updated version that solves the deep fake problems. “The trained model is restricted to retain the source speaker’s voice, and unlike the original Translatotron, it is not able to generate spee

Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Avi Gopani

Avi Gopani is a technology journalist that seeks to analyse industry trends and developments from an interdisciplinary perspective at Analytics India Magazine. Her articles chronicle cultural, political and social stories that are curated with a focus on the evolving technologies of artificial intelligence and data analytics.

How can Federated learning be used for speech emotion recognition?

Major Announcements By Jensen Huang During NVIDIA GTC Keynote Speech

Why Speech-to-Speech Translation Is So Important For Google

Top Speech-To-Speech Translation Models & Tools In Market Today

Google Upgrades Translatotron, Its Speech-to-Speech Translation Model

A Tutorial on Spectral Feature Extraction for Audio Analytics

Don’t Miss the Next Big Shift in AI.

Get one year subscription for ₹5999

The AI Coding Gold Rush Ends Where Harness Begins

“Only 30% of software engineering happens on the laptop. The real 70% starts after you commit the code,” says Jyoti

How Gradient-Boosting is Quietly Powering India’s Research Push

From groundwater and slopes to carbon sinks, tools like CatBoost are enabling Indian scientists to extract insights and drive sustainability.

India’s Data Centre Boom Is Running Into a Talent Wall

With capacity expected to more than double this decade, the industry is investing in training as graduates struggle to meet

This Firm Wants to be the ‘Next Big Disruptor’ in Networking

Arrcus positions itself as a horizontal software layer that can run across different types of networking hardware.

Will 2026 be the year of AI IPOs?

With CoreWeave’s listing and Fractal Analytics going for an IPO, an array of AI companies are now looking to raise

Fighting Deepfakes May Not Be a Technology Problem

Defenders must be active at all times, while attackers need only one opportunity.

India’s Data Centre Expansion Is Decentralising

Without compute buildup beyond metros, the next wave of digital adoption will be constrained

How Mumbai Keeps Winning India’s Data-Centre Race

Land prices are among the highest in the country, but total build economics remain competitive by global standards.

Download the easiest way to
stay informed

Flagship Events

Google Releases New Version of Translatotron: Its End-to-end Speech Translation Model

Happy Llama 2026 The Must-Attend Summit for AI Startups Now in Bangalore and San Francisco