Published on December 18, 2024
In AI Features

The Rise of Reasoning Models

Name: The Rise of Reasoning Models
Uploaded: 2024-12-18T18:00:00+05:30
Channel: Sagar Sharma
Description: One of the possible reasons behind the recent surge in reasoning models could be the reasoning datasets.

One of the possible reasons behind the recent surge in reasoning models could be the reasoning datasets.

Image by Nalini Nirad

By Sagar Sharma

‘How many R’s does the word Strawberry have?’ is a question we have all asked LLMs. Finally, when OpenAI revealed the o1 series of models, we got the R’s correct. From here, we witnessed a shift that enabled almost everyone to carry “PhD-level intelligence”. In a recent podcast, Diana Hu, general partner at Y Combinator, said that the rise of reasoning models can be traced back to OpenAI’s early work with DOTA, where they implemented reinforcement learning techniques inspired by AlphaGo and AlphaZero. The o1 models did something different underneath: they used CoT and reasoning tokens to answer complex questions, which was not possible earlier. If we consider the technical breakthroughs for reasoning models, one of the most significant research was Reflectio

Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Sagar Sharma

A software engineer who loves to experiment with new-gen AI. He also happens to love testing hardware and sometimes they crash. While reviving his crashed system, you can find him reading literature, manga, or watering plants.