In this paper, we consider an infinite horizon average reward Markov Dec...
We consider the problem of constrained Markov decision process (CMDP) in...
Reinforcement learning is widely used in applications where one needs to...
We consider the problem of tabular infinite horizon concave utility
rein...
We consider the problem of constrained Markov Decision Process (CMDP) wh...
Many engineering problems have multiple objectives, and the overall aim ...
In the optimization of dynamical systems, the variables typically have
c...
In the optimization of dynamic systems, the variables typically have
con...
Gradient descent and its variants are widely used in machine learning.
H...