We extend the options framework for temporal abstraction in reinforcemen...
In model-based reinforcement learning (MBRL), Wan et al. (2019) showed
c...
We introduce improved learning and planning algorithms for average-rewar...
Discounted reinforcement learning is fundamentally incompatible with fun...
Imitation learning algorithms learn viable policies by imitating an expe...