Many model-based reinforcement learning (RL) algorithms can be viewed as...
A common technique in reinforcement learning is to evaluate the value
fu...
We provide performance guarantees for a variant of simulation-based poli...
When the sizes of the state and action spaces are large, solving MDPs ca...
We consider Markov Decision Processes (MDPs) in which every stationary p...
We propose a distributed algorithm to compute an equilibrium in aggregat...