A long-term goal of reinforcement learning agents is to be able to perfo...
Many currently deployed Reinforcement Learning agents work in an environ...
Potential-based reward shaping (PBRS) is an effective and popular techni...
Recent advances of gradient temporal-difference methods allow to learn
o...