Most offline reinforcement learning (RL) algorithms return a target poli...
Offline reinforcement learning (RL) struggles in environments with rich ...
Goal-conditioned reinforcement learning (RL) is a promising direction fo...
Learning to control an agent from data collected offline in a rich
pixel...
By providing unprecedented access to computational resources, cloud comp...
Most theoretically motivated work in the offline reinforcement learning
...
A central object of study in Reinforcement Learning (RL) is the Markovia...
Daniely and Schacham recently showed that gradient descent finds adversa...
Attention is a powerful component of modern neural networks across a wid...
We investigate the discounting mismatch in actor-critic algorithm
implem...
Denoising score matching with Annealed Langevin Sampling (DSM-ALS) is a
...
Attention is a powerful component of modern neural networks across a wid...
Our work is based on the hypothesis that a model-free agent whose
repres...
Adversarial learning has demonstrated good performance in the unsupervis...
We present a reduction from reinforcement learning (RL) to no-regret onl...
Previous work has shown the unreliability of existing algorithms in the ...
Batch Reinforcement Learning (Batch RL) consists in training a policy us...
Due to the ability of deep neural nets to learn rich representations, re...
Inspired by the phenomenon of catastrophic forgetting, we investigate th...
While a lot of progress has been made in recent years, the dynamics of
l...
While recent progress has spawned very powerful machine learning systems...
We propose a recurrent RL agent with an episodic exploration mechanism t...