Last updated October 27, 2022
In AI Mysteries

What Happened in Reinforcement Learning in 2022

From robots playing football to learning how to walk on the moon!

Share

Published on October 25, 2022

by Tasmia Ansari

Listen to this story

Just like how we learn from our environment and our actions determine whether we are rewarded or punished, so do reinforcement learning agents whose ultimate aim is to maximise the rewards.

This article brings the top 8 reinforcement learning innovations that shaped AI across several industries in 2022.

Ithaca – AI model to decipher ancient text

Alphabet’s DeepMind collaborated with the University of Venice, the University of Oxford and the Athens University of Economics and Business to build a deep neural network called ‘Ithaca’, which can restore missing text from ancient texts.

In a paper published in Nature, DeepMind stated that Ithaca was trained using natural language processing (NLP) to not only recover lost ancient text that has been damaged over time but also identify the original location of the text and establish the date when it was made.

For more information, click here.

AlphaTensor – Fastest method to multiply matrices

With DeepMind’s latest release AlphaTensor, an AI system (based on a 3D board game), researchers shed light on a 50-year-old fundamental mathematics question of finding the fastest way to multiply two matrices.

To play the game, the researchers trained a new version of AlphaZero, called ‘AlphaTensor’. Instead of learning the best moves to make in ‘Go’ or chess, the system learned the best steps to make when multiplying matrices. Then, using DeepMind’s favourite reinforcement learning, the system was rewarded for winning the game in as few moves as possible.

For more information, click here.

Architecture for tokamak magnetic controller design

Google’s DeepMind AI team collaborated with physicists from the Swiss Plasma Centre at EPFL in Ecublens, Switzerland, to develop an AI method to control the plasmas inside a nuclear fusion reactor.

The study helps further nuclear fusion research and could also help quicken the arrival of a cheaper, cleaner, and unlimited source of energy.

For more information, click here.

Human-level Atari 200x Faster

In the new paper ‘Human-level Atari 200x Faster’, a DeepMind research team applies diverse strategies to Agent57, with their resulting MEME (Efficient Memory-based Exploration) agent surpassing the human baseline on all 57 Atari games in just 390 million frames—two orders of magnitude faster than Agent57.

For more information, click here.

LEAP (Legged Exploration of the Aristarchus Plateau)

Just like Apollo astronauts, a four-legged robot trained through AI learned that jumping is the best way to move around on the moon’s surface.

An update on LEAP, a mission concept study to explore some of the most challenging lunar terrains, was presented in September at the Europlanet Science Congress (EPSC) 2022.

The robot has been trained using reinforcement learning in a virtual environment to simulate the lunar ground, dust properties as well as gravity.

For more information, click here.

InstructGPT

OpenAI used reinforcement learning from human intervention and feedback fine-tuned GPT-3. As a result, the new model, ‘InstructGPT’, is extremely good at generating text from single-sentence prompts.

(Source: OpenAI Blog)

For more information, click here.

MIT’s mini cheetah robot

MIT researchers detail how they taught a mini cheetah robot to play goalie in a soccer match through reinforcement learning.

According to the researchers, the proposed framework can be extended to other scenarios. The authors explained, “Soccer goalkeeping using quadrupeds combines highly dynamic locomotion with precise and fast non-prehensile object manipulation. The robot needs to react and intercept a flying ball using dynamic locomotion manoeuvres in a very short amount of time, usually less than one second”.

For more information, click here.

Sparrow – DeepMind’s Chatbot

To fill the communication gap between man and machine, DeepMind recently released its new AI chatbot ‘Sparrow’, a “useful dialogue agent that reduces the risk of unsafe and inappropriate answers”.

As per the subsidiary of Google’s parent company, Alphabet, the chatbot is designed to “talk, answer questions and look up evidence using Google when it’s helpful to inform its responses”.

For more information, click here.

Access all our open Survey & Awards Nomination forms in one place

Share

Tasmia Ansari

Tasmia is a tech journalist at AIM, looking to bring a fresh perspective to emerging technologies and trends in data science, analytics, and artificial intelligence.

Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Why NVIDIA is Acquiring Run:ai

Why NVIDIA is Acquiring Run:ai

Mohit Pandey

Run ai specialises in enabling enterprise customers manage and optimise their compute infrastructure efficiently

CTO Kailash Nadh Zerodha

Zerodha CTO Says He Stopped Googling Technical Stuff Over the Past Year

Vandana Nair

Smartphones Will Soon be Dead

Smartphones Will Soon be Dead

Vidyashree Srinivas

Top Editorial Picks

GitHub Copilot Rival, Augment Secures $252 Mn at $1 Bn Valuation to Boost AI for Developers

K L Krithika

Synology Launches Advanced Data Management & Security Solutions Against Ransomware in India

Pritam Bordoloi

PyTorch Releases Version 2.3 with Focus on Large Language Models and Sparse Inference

K L Krithika

Healthtech AI startup Endimension Technology raises INR 6 Crore in Pre-Series A Round

Pritam Bordoloi

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Also in News

Adobe Unveils World’s First Large-Scale GAN-based Model for Video Super-Resolution

Snowflake Arctic

Snowflake Releases Open Enterprise LLM, Arctic with 480 Billion Parameters

Now Run Programs in Real Time with Llama 3 on Groq

Guardians of the Syntax: Securing Enterprise LLM Systems against Emerging Threats

Fibe Leverages Amazon Bedrock to Increase Customer Support Efficiency by 30%

This 18-Year-Old Programmer is Creating an Open Source Alternative to Redis

This 18-Year-Old Programmer is Creating an Open Source Alternative to Redis

US India Investments

India will Need at least $200-300 Mn to Build GPT-5-level AI Model

Doctors Use Apple Vision Pro to Enhance Shoulder Arthroscopy Surgery

Doctors Use Apple Vision Pro to Enhance Shoulder Arthroscopy Surgery

AI Courses & Careers

View All

India is a Goldmine for AI Talent

Donna Eva 15/04/2024

Top 10 LMS Platforms for Enterprise AI Training and Development

Analytics India Magazine 14/04/2024

AI Clock is Ticking: Wake Up Call for Education Institutions

Siddharth Jindal 18/09/2023

Become a Certified Generative AI Engineer

Industry
Insights

View All

New Relic Enhances AI Monitoring, Industry’s First APM for AI

Pritam Bordoloi 25/04/2024

BCG Predicts AI to Drive 20% of 2024 Revenues, Doubling to 40% by 2026

Shritama Saha 24/04/2024

AI Can Now Edit DNA of Human Cells

Gopika Raj 23/04/2024

Check our Industry Research Reports

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.

AIM Videos

What is Computer Vision and How it Works?

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

GenAI
Corner

View All

7 AI Startups that Featured on Shark Tank India Season 3

Siddharth Jindal 15/04/2024

Top 9 Semiconductor GCCs in India

Shyam Nandan Upadhyay 15/04/2024

Top 6 Devin Alternatives to Automate Your Coding Tasks

Siddharth Jindal 08/04/2024

10 Free AI Courses by NVIDIA

Shritama Saha 02/04/2024

Top 6 AI/ML Hackathons to Participate in 2024

Siddharth Jindal 22/03/2024

What’s Devin Up to?

K L Krithika 17/03/2024

10 Underrated Women in AI to Watchout For

K L Krithika 11/03/2024

10 AI Startups Run by Incredible Women Entrepreneurs

K L Krithika 08/03/2024

Data
Dialogues

View All

Automation Anywhere Wants to Augment Humans with AI, Not Replace Them

Shritama Saha 18/04/2024

Father of Computational Theory Wins 2023 Turing Award

Shritama Saha 13/04/2024

Falcon- TII- UAE

Building Open Source LLMs is Not for Everyone

Vandana Nair 12/04/2024

This 20-year-old AI Researcher Created the much-needed Indic LLM Leaderboard

This 20-year-old AI Researcher Created the much-needed Indic LLM Leaderboard

Mohit Pandey 10/04/2024

NPCI is Exploring AI-Powered Futuristic Payment Frontiers: CTO

Pritam Bordoloi 08/04/2024

Prisma AI

Prisma AI Has an ‘Eye on You’ at Adani Airports

Vandana Nair 06/04/2024

Salesforce Chief Ethicist Deems Doomsday AI Discussions a ‘Waste of Time’

Pritam Bordoloi 01/04/2024

ManageEngine Zoho

Zoho’s ManageEngine Invests $10 Mn in NVIDIA, Intel, and AMD GPUs

Vandana Nair 30/03/2024

Future
Talks

View All

ai jobs india

T-Hub Supported MATH is Launching AI Career Finder to Create AI Jobs

Pritam Bordoloi 23/04/2024

Quora’s Poe Eats Google’s Lunch

Gopika Raj 17/04/2024

Zoho teams up with Intel for optimizing video AI workloads

Zoho Collaborates with Intel to Optimise & Accelerate Video AI Workloads

Gopika Raj 08/04/2024

Rakuten Certified as Best Firm for Data Scientists for the 2nd Time

Analytics India Magazine 08/04/2024

bulls.ai

This Indian Logistics Company Developed an LLM to Enhance Last-Mile Delivery

Pritam Bordoloi 02/04/2024

Perplexity AI

Perplexity AI Reviews with Pro Access

Vandana Nair 02/04/2024

Apple WWDC 2024

What to Expect at the ‘Absolutely Incredible’ Apple WWDC 2024

Vandana Nair 31/03/2024

Code Generator

Will StarCoder 2 Win Over Enterprises?

Pritam Bordoloi 20/03/2024

Developer’s Corner

Japan is the Next Big Hub for Indian Tech Talent

Siddharth Jindal 22/04/2024

Will TypeScript Wipe Out JavaScript?

K L Krithika 21/04/2024

Meta Llama 3

Meta Forces Developers Cite ‘Llama 3’ in their AI Development

Sukriti Gupta 19/04/2024

Why Developers Hate Jira

Why Developers Hate Jira

Mohit Pandey 01/04/2024

In Case You Missed It

Which is the Most Frustrating Programming Language?

Which is the Most Frustrating Programming Language?

Mohit Pandey 18/03/2024

AI4Bharat Rolls Out IndicLLMSuite for Building LLMs in Indian Languages

Shritama Saha 15/03/2024

Google Introduces Synth^2 to Enhance the Training of Visual Language Models

K L Krithika 14/03/2024

Infosys Funds Llama 2 Project with 22 Indian Languages

Infosys Founder Funds Meta’s Llama 2 Project with 22 Indian Languages

Mohit Pandey 13/03/2024

Webstories

Excel tools

9 Best AI Tools for Excel and Google Spread Sheet Automation

Generative AI Certification Courses

8 Best Generative AI Courses for Executives and Managers

Add ChatGPT Chrome Extension Right Away

Top 8 AI Browser Extensions for Chrome Users in 2024

Dead Programming Languages

Top 5 Devin AI Alternatives for Coders and Developers

Programming language concept. System engineering. Software development.

10 Best AI Code Generator Tools to Use for Free in 2024

STAR Framework for Measuring AI Trust: Safety, Transparency, Accountability and Responsibility

What are the Responsibility of Developers Using Generative AI

Also in Trends

GitHub Secures Millions of Developers Through Two-Factor Authentication

90% of Indian Internet Users are already using AI, says Report

90% of Indian Internet Users are already using AI, says Report

Jensen Huang Personally Delivers First NVIDIA DGX H200 to OpenAI

Cognition Labs Devin funding

Six Months Old Cognition Labs Raises $175 Mn from Founders Fund at $2 Bn Valuation

apple

Apple Releases Four Open Source LLMs with OpenELM Series of Models

Adobe Launches Firefly Image 3 Beta With Auto Stylisation, Structure Reference Capabilities

C.P. Gurnani & InterGlobe’s Rahul Bhatia Announce AI Business Venture AIonOS

How Good is Llama 3 for Indic Languages?

AWS Brings Meta’s Llama 3 Models on Amazon Bedrock

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

AIM Launches the 3rd Edition of Data Engineering Summit. May 30-31, Bengaluru

Join the forefront of data innovation at the Data Engineering Summit 2024 where industry leaders redefine technology 8217 s future

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024