Hugging Face integrates Decision Transformers into transformers library

The Decision Transformer model abstracts reinforcement learning as a conditional-sequence modelling problem.

Hugging Face has integrated the Decision Transformer, an offline reinforcement learning method, into the Hugging Face transformers library and the Hugging Face hub. Hugging Face plans to improve accessibility in the field of deep RL and looks forward to sharing them with users over the coming weeks.

The Decision Transformer model abstracts reinforcement learning as a conditional-sequence modelling problem. The main idea is that instead of training a policy using RL methods, such as fitting a value function, that will tell us what action to take to maximise the return (cumulative reward), Hugging Face uses a sequence modelling algorithm (Transformer) that, given the desired return, past states, and actions, will generate future actions to achieve this desired return. It’s an autoregressive model conditioned on the desired return, past states, and actions to generate future actions that achieve the desired return.
This is a complete shift in the reinforcement learning paradigm since they use generative trajectory modelling (modelling the joint distribution of the sequence of states, actions, and rewards) to replace conventional RL algorithms.


Sign up for your weekly dose of what's up in emerging technology.

More Great AIM Stories

Kartik Wali
A writer by passion, Kartik strives to get a deep understanding of AI, Data analytics and its implementation on all walks of life. As a Senior Technology Journalist, Kartik looks forward to writing about the latest technological trends that transform the way of life!

Our Upcoming Events

Masterclass, Virtual
How to achieve real-time AI inference on your CPU
7th Jul

Masterclass, Virtual
How to power applications for the data-driven economy
20th Jul

Conference, in-person (Bangalore)
Cypher 2022
21-23rd Sep

Conference, Virtual
Deep Learning DevCon 2022
29th Oct

3 Ways to Join our Community

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Telegram Channel

Discover special offers, top stories, upcoming events, and more.

Subscribe to our newsletter

Get the latest updates from AIM

What can SEBI learn from casinos?

It is said that casino AI technology comes with superior risk management systems compared to traditional data analytics that regulators are currently using.

Will Tesla Make (it) in India?

Tesla has struggled with optimising their production because Musk has been intent on manufacturing all the car’s parts independent of other suppliers since 2017.