research
∙
01/18/2022
AdaTerm: Adaptive T-Distribution Estimated Robust Moments towards Noise-Robust Stochastic Gradient Optimizer
As the problems to be optimized with deep learning become more practical...
research
∙
08/02/2021
Adaptive t-Momentum-based Optimization for Unknown Ratio of Outliers in Amateur Data in Imitation Learning
Behavioral cloning (BC) bears a high potential for safe and direct trans...
research
∙
08/25/2020
t-Soft Update of Target Network for Deep Reinforcement Learning
This paper proposes a new robust update rule of the target network for d...
research
∙
02/29/2020