Published on December 13, 2022
In AI News

Reinforcement Learning Rant Continues

Name: Reinforcement Learning Rant Continues
Uploaded: 2022-12-13T14:34:47+05:30
Channel: Mohit Pandey
Description: Weeks after cherry-picking reinforcement learning, Meta AI chief Yann LeCun went on Twitter again to express his views and remind everyone how he had predicted the reach and impact of reinforcement learning.

Yann LeCun said that though RL is inevitable in machine learning, the purpose behind incorporating it in algorithms should be to eventually minimise its use.

By Mohit Pandey

Weeks after cherry-picking reinforcement learning, Meta AI chief Yann LeCun went on Twitter again to express his views and remind everyone how he had predicted the reach and impact of reinforcement learning. LeCun has been a big proponent of self-supervised learning and also said that innovations using SSL have been working better than he anticipated.

The impact of RL has been exactly as small as I had predicted.
The impact of Self-Supervised Learning has been even bigger that I had predicted. https://t.co/liZQQeanLn
— Yann LeCun (@ylecun) December 12, 2022

The discussion started when Dan Becker, former Google data scientist and founder of Decision AI, tweeted saying that after AlphaGo beat Lee Sedol, a Go professional from South Korea, many researchers started to believe that reinforcement learning will be a game changer. But the impact now seems smaller because of its limitations in real world problems.

During an exclusive interaction with Analytics India Magazine, LeCun said that though reinforcement learning is very efficient, it requires multiple trials. The machine has to learn from the responses that it receives from the world, and this proves to be highly inefficient and unreliable.

Read: Angels & Demons of AI

LeCun adds to his point saying that though RL is inevitable in machine learning, the purpose behind incorporating it in algorithms should be to eventually minimise its use. Agreeing to this, Sebastian Raschka, an AI researcher, agreed with LeCun and said that for large language models to work, self-supervised learning is a must.

Further in the discussion, LeCun said that even ChatGPT uses SSL more than RL but there are only two obstacles – defining explicit objectives and planning abilities.

This is not the first time that LeCun has criticised reinforcement learning. Recently, presenting for SSL at NeurIPS 2022, he suggested that researchers abandon reinforcement learning to further the development in AI.

📣 Want to advertise in AIM? Book here

Mohit Pandey

Mohit writes about AI in simple, explainable, and often funny words. He's especially passionate about chatting with those building AI for Bharat, with the occasional detour into AGI.