We study regret minimization in online episodic linear Markov Decision
P...
We study reinforcement learning with linear function approximation and
a...
An abundance of recent impossibility results establish that regret
minim...
We study to what extent may stochastic gradient descent (SGD) be underst...
We study online convex optimization in the random order model, recently
...
We study a variant of online convex optimization where the player is
per...