Imitation learning (IL) is a simple and powerful way to use high-quality...
We demonstrate the first large-scale application of model-based generati...
Conventionally, generation of natural language for dialogue agents may b...
Goal-oriented dialogue systems face a trade-off between fluent language
...
Off-policy evaluation (OPE) holds the promise of being able to leverage
...
In this work we consider data-driven optimization problems where one mus...
In this tutorial article, we aim to provide the reader with the conceptu...
The offline reinforcement learning (RL) problem, also referred to as bat...
The offline reinforcement learning (RL) problem, also referred to as bat...
Imitation learning algorithms provide a simple and straightforward appro...
Designing effective model-based reinforcement learning algorithms is
dif...
Off-policy reinforcement learning aims to leverage experience collected ...
Q-learning methods represent a commonly used class of algorithms in
rein...
Reinforcement learning is a promising framework for solving control prob...
The design of a reward function often poses a major practical challenge ...
Deep reinforcement learning (RL) can acquire complex behaviors from low-...