OpenAI’s neural theorem prover can solve Math Olympiad problems

The theorem prover achieved 41.2% vs 29.3% on the miniF2F benchmark, a challenging collection of high-school olympiad problems.

OpenAI has built a neural theorem prover for Lean that was able to solve a range of problems from AMC12 and AIME competitions as well as problems from the IMO.

The prover uses a language model to find proofs of formal statements. Whenever a new proof is found, the prover uses this as training data which improves the neural network and enables it to constantly find solutions to more complex problems.

Formal mathematics involves two major challenges that make a simple application of reinforcement learning unlikely to succeed: Infinite action space and Lack of self-play.


Sign up for your weekly dose of what's up in emerging technology.

To overcome the infinite action space challenge, the natural theorem prover uses sampling actions from a language model as it searches for proof. Language models have the capacity to generate tactic calls and mathematical terms often required as arguments. To address the lack of self-play, the creators replaced unsupervised curriculum with an auxiliary set of problem statements of varying difficulty. The training procedure was able to solve a curriculum of increasingly difficult problems when provided with a varied set of difficulties in auxiliary problems.

Although, the neural theorem prover was able to generate exciting results proving the potential deep learning models capability of non-trivial mathematical reasoning when interacting with a formal system. But, the system is still far away from providing consistent results when compared with the performance of the best students who participate in these challenging olympiad problems.

Download our Mobile App

More Great AIM Stories

SharathKumar Nair
Sharath is an ardent believer in the ‘Transhumanism’ movement. Anything and everything about technology excites him. At Analytics India Magazine, he writes about artificial intelligence, cybersecurity and the impact these emerging technologies have on day-to-day human lives. When not working on a story, he spends his time reading tech novels and watching sci-fi movies and series.

AIM Upcoming Events

Regular Passes expire on 3rd Mar

Conference, in-person (Bangalore)
Rising 2023 | Women in Tech Conference
16-17th Mar, 2023

Early Bird Passes expire on 17th Feb

Conference, in-person (Bangalore)
Data Engineering Summit (DES) 2023
27-28th Apr, 2023

Conference, Virtual
Deep Learning DevCon 2023
27 May, 2023

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox