Published on July 10, 2020
In Deep Tech

Duality: A New Approach to Reinforcement Learning

By Ram Sagar

The environment and its underlying dynamics of all the reinforcement learning problems are typically abstracted as a Markov decision process (MDP). Because MDPs are useful for studying optimisation problems solved via dynamic programming and reinforcement learning. Today, most of the effective existing reinforcement learning (RL) algorithms are rooted in this dynamic programming paradigm. An alternative paradigm for RL is based on linear programming (LP). The researchers at Google, try to generalise this LP approach and have tried to demonstrate how relevant it is to RL. A Brief Of Overview Of Duality Duality or the duality principle is associated with the optimisation theory, which posits that optimisation problems can be perceived as the primal problem or the dual problem.

Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Ram Sagar

I have a master's degree in Robotics and I write about machine learning advancements.

Why Everyone is Rushing to Build Reinforcement Learning Environments

Former Google DeepMind Researchers Go Deep for Sales Triumph

DeepMind Wants to Take Humans Out of RLHF

Who Will Win the AGI Race?

Google Introduces Offline Reinforcement Learning to Train AI Agents

Top Reinforcement Learning Algorithms

Don’t Miss the Next Big Shift in AI.

Get one year subscription for ₹5999

Top 10 Companies That Crowned Hyderabad as India’s Greenfield GCC Leader in 2025

Telangana has attracted over 75 greenfield GCCs in 2025, compared with 40-plus in Karnataka.

The AI Coding Gold Rush Ends Where Harness Begins

“Only 30% of software engineering happens on the laptop. The real 70% starts after you commit the code,” says Jyoti

How Gradient-Boosting is Quietly Powering India’s Research Push

From groundwater and slopes to carbon sinks, tools like CatBoost are enabling Indian scientists to extract insights and drive sustainability.

India’s Data Centre Boom Is Running Into a Talent Wall

With capacity expected to more than double this decade, the industry is investing in training as graduates struggle to meet

This Firm Wants to be the ‘Next Big Disruptor’ in Networking

Arrcus positions itself as a horizontal software layer that can run across different types of networking hardware.

Will 2026 be the year of AI IPOs?

With CoreWeave’s listing and Fractal Analytics going for an IPO, an array of AI companies are now looking to raise

Fighting Deepfakes May Not Be a Technology Problem

Defenders must be active at all times, while attackers need only one opportunity.

India’s Data Centre Expansion Is Decentralising

Without compute buildup beyond metros, the next wave of digital adoption will be constrained

Download the easiest way to
stay informed

Flagship Events

Duality: A New Approach to Reinforcement Learning

Happy Llama 2026 The Must-Attend Summit for AI Startups Now in Bangalore and San Francisco