Baidu Launches World’s Largest Dialogue Generation Model With 11 Billion Parameters

PLATO-XL is trained on a high-performance GPU cluster with 256 NVIDIA Tesla V100 32G GPU cards.
Baidu Launches World’s Largest Dialogue Generation Model With 11 Billion Parameters
Earlier this week, the Chinese internet giant Baidu released PLATO-XL, a pre-trained dialogue generation model with up to 11 billion parameters. It adopts the architecture of a unified transformer with high computation and parameter efficiency.  PLATO-XL carries out multi-party aware pre-training to better distinguish the characteristic information in social media conversation. As a result, it achieves superior performance compared to other approaches in both English and Chinese. Furthermore, PLATO-XL has effectively reduced the inconsistency phenomenon in multi-turn conversations, thanks to the multi-party aware pre-training.  Soon, the company plans to release the source code on GitHub. "We will release our source code together with the English model at GitHub, hoping to facilitate frontier research in dialogue generation," said Baidu researchers.  https://twitter.com/BaiduResearch/status/1442558874523824129 Language Models Vs Dialogue Generation Models&n
Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Picture of Amit Naik
Amit Naik
Amit Raja Naik is Senior Editorial Producer – Live Shows at AIM Network, driving India’s most influential AI and technology conversations. He leads content, narrative design, and visual storytelling, engaging with leaders, innovators, and policymakers to advance how technology impacts businesses, governance, and society.
Related Posts
AIM Print and TV
Don’t Miss the Next Big Shift in AI.
Get one year subscription for ₹5999
Download the easiest way to
stay informed