AIM Banners_978 x 90

Beyond the Surface: The Role of Diffusion in Language Models

Diffusion models became the paradigm for generative models in 2022. Unlike autoregressive models that require restricted connectivity patterns for ensuring causality, diffusion models are unconstrained and thus allow more creative freedom. But given the efficiency of current autoregressive models (example ChatGPT), is researching diffusion models in the language field even worth it?
Diffusion Models were introduced in the field of AI in 2015. Jascha Sohl-Dickstein, finding inspiration from the physics principles of non-equilibrium thermodynamics, developed the technique for generating images that outperformed GANs in terms of quality and speed. This gave rise to the successful text-to-image models like DALL.E2 and Stable Diffusion. Sander Dieleman, Research Scientist at DeepMind, recently published a blog exploring if diffusion models can also be used for language tasks. He perfectly iterates the part about how we proceed with language in real life—starting with a basic concept in mind about a topic and then writing words, phrasing, and structuring comes later. This concept, interestingly, looks similar to how diffusion models work and the approach can definitely
Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Picture of Mohit Pandey
Mohit Pandey
Mohit writes about AI in simple, explainable, and often funny words. He's especially passionate about chatting with those building AI for Bharat, with the occasional detour into AGI.
Related Posts
AIM Print and TV
Don’t Miss the Next Big Shift in AI.
Get one year subscription for ₹5999
Download the easiest way to
stay informed