We show two average-reward off-policy control algorithms, Differential Q...
We propose a new objective for option discovery that emphasizes the
comp...
In recent years, a growing number of deep model-based reinforcement lear...
The complementary fusion of light detection and ranging (LiDAR) data and...
We extend the options framework for temporal abstraction in reinforcemen...
In model-based reinforcement learning (MBRL), Wan et al. (2019) showed
c...
We consider off-policy policy evaluation with function approximation (FA...
Anomaly detection is a crucial and challenging subject that has been stu...
We introduce improved learning and planning algorithms for average-rewar...
The optimal policy of a reinforcement learning problem is often disconti...
Distribution and sample models are two popular model choices in model-ba...
In computer vision, the estimation of the fundamental matrix is a basic
...