We consider regret minimization for Adversarial Markov Decision Processe...
It is well-known that the worst-case minimax regret for sparse linear ba...
In this paper, we generalize the concept of heavy-tailed multi-armed ban...
We consider the Scale-Free Adversarial Multi Armed Bandit (MAB) problem ...