Transformers were originally proposed as a sequence-to-sequence model fo...
We present an algorithm for local, regularized, policy improvement in
re...
Offline methods for reinforcement learning have the potential to help br...
Robust Markov Decision Processes (RMDPs) intend to ensure robustness wit...
Reinforcement learning (RL) has proven its worth in a series of artifici...
The ability to transfer skills across tasks has the potential to scale u...
The ability of a reinforcement learning (RL) agent to learn about many r...