In the past decade, model-free reinforcement learning (RL) has provided
...
Recent policy optimization approaches have achieved substantial empirica...
Recent policy optimization approaches (Schulman et al., 2015a, 2017) hav...
We propose a novel framework for multi-task reinforcement learning (MTRL...
Though successful in high-dimensional domains, deep reinforcement learni...
Hierarchical learning (HL) is key to solving complex sequential decision...
A temporally abstract action, or an option, is specified by a policy and...
Many real-world reinforcement learning problems have a hierarchical natu...
Motivated by the increasing integration among electricity markets, in th...
Congestion problems are omnipresent in today's complex networks and repr...
This paper explores the performance of fitted neural Q iteration for
rei...
Potential-based reward shaping (PBRS) is an effective and popular techni...
Recent advances of gradient temporal-difference methods allow to learn
o...