The feedback that users provide through their choices (e.g., clicks,
pur...
Reinforcement learning (RL) has emerged as a powerful paradigm for
fine-...
Users interact with text, image, code, or other editors on a daily basis...
We present lilGym, a new benchmark for language-conditioned reinforcemen...
Successor-style representations have many advantages for reinforcement
l...
We propose an algorithm for tabular episodic reinforcement learning with...
Imitation learning algorithms provide state-of-the-art results on many
s...
In standard reinforcement learning (RL), a learning agent seeks to optim...
Standard sequential generation methods assume a pre-specified generation...
We describe the University of Maryland machine translation systems submi...