research
∙
06/16/2023
Temporal Difference Learning with Experience Replay
Temporal-difference (TD) learning is widely regarded as one of the most ...
research
∙
02/20/2023
Backstepping Temporal Difference Learning
Off-policy learning ability is an important feature of reinforcement lea...
research
∙
07/25/2022
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View
Q-learning has long been one of the most popular reinforcement learning ...
research
∙
02/11/2022
Regularized Q-learning
Q-learning is widely used algorithm in reinforcement learning community....
research
∙
09/09/2021