Trust-region methods based on Kullback-Leibler divergence are pervasivel...
In this paper, we present a Distributionally Robust Markov Decision Proc...
Demand response (DR) has been demonstrated to be an effective method for...
In this paper, we propose methods for functional predictor selection and...
Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization...
Mobile and ubiquitous sensing of urban air quality (AQ) has received
inc...