In this paper, we introduce a novel form of value function, Q(s, s'), th...
Imitation by observation is an approach for learning from expert
demonst...
We describe a novel approach to imitation learning that infers latent
po...
Goals for reinforcement learning problems are typically defined through
...
A major bottleneck for developing general reinforcement learning agents ...
In reinforcement learning, we often define goals by specifying rewards w...