This paper proposes an off-line algorithm, called Recurrent Model Predic...
This paper presents a constrained deep adaptive dynamic programming (CDA...
This paper proposes the Deep Generalized Policy Iteration (DGPI) algorit...
In this paper we analyze the stability of different coupling strategies ...