Imitation learning (IL) algorithms often rely on inverse reinforcement
l...
Neural networks (NNs) playing the role of controllers have demonstrated
...
A misspecified reward can degrade sample efficiency and induce undesired...
Reward design is a fundamental problem in reinforcement learning (RL). A...
We study the problem of policy repair for learning-based control policie...
Apprenticeship learning (AL) is a class of "learning from demonstrations...