Fusion Is The Future: OpenAI Co-founder Bets On Language Models In A Visual World

“In 2021, language models will start to become aware of the visual world.”Ilya Sutskever, co-founder, OpenAI For many years, within the realms of AI, there has been a lot of talk about Artificial General Intelligence or AGI -- building algorithms that can learn on the go and simulate human cognition. However, the elusive human brain remains too complex to clone. So, AI researchers started focusing on winning small by taming specific skills. Rules were written, models were tweaked. Today, we have computer vision models that can detect faces in the group and in the dark, and language models that can write prose. But, these are two separate skills--vision and speech. And now, there is a major push for neural networks that can do both. These objectives were further reiterated in Ilya Sutskever’s recent interview on Andrew Ng’s The Batch.  “ If you can expose models to data similar to those absorbed by humans, they should learn concepts in a way that’s more similar to
Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Picture of Ram Sagar
Ram Sagar
I have a master's degree in Robotics and I write about machine learning advancements.
Related Posts
AIM Print and TV
Don’t Miss the Next Big Shift in AI.
Get one year subscription for ₹5999
Download the easiest way to
stay informed