Reinforcement learning is widely used in applications where one needs to...
We consider the problem of tabular infinite horizon concave utility
rein...
Mean field control (MFC) is an effective way to mitigate the curse of
di...
We consider the problem of constrained Markov Decision Process (CMDP) wh...
Many engineering problems have multiple objectives, and the overall aim ...
We consider the problem where M agents interact with M identical and
ind...
We consider the problem where N agents collaboratively interact with an
...
Reinforcement learning typically assumes that the state update from the
...
We consider the bandit problem of selecting K out of N arms at each time...
Gradient descent and its variants are widely used in machine learning.
H...
Quantum key distribution (QKD) allows two distant parties to share encry...
Reinforcement Learning (RL) is being increasingly applied to optimize co...
Many real-world problems face the dilemma of choosing best K out of N
op...