We study regret minimization in online episodic linear Markov Decision
P...
We study diverse skill discovery in reward-free environments, aiming to
...
We present the OMG-CMDP! algorithm for regret minimization in adversaria...
We present the UC^3RL algorithm for regret minimization in Stochastic
Co...
We consider the problem of controlling an unknown linear dynamical syste...
We consider the problem of controlling an unknown linear dynamical syste...
We consider stochastic optimization with delayed gradients where, at eac...
We study the Stochastic Shortest Path (SSP) problem in which an agent ha...
We study a novel variant of online finite-horizon Markov Decision Proces...
Stochastic shortest path (SSP) is a well-known problem in planning and
c...
We consider the problem of learning in Linear Quadratic Control systems ...
We consider the applications of the Frank-Wolfe (FW) algorithm for
Appre...
We derive and analyze learning algorithms for policy evaluation,
apprent...
We present the first computationally-efficient algorithm with
O(√(T)) r...
We study a classic algorithmic problem through the lens of statistical
l...
In principal-agent models, a principal offers a contract to an agent to
...
We study the problem of controlling linear time-invariant systems with k...
In many practical uses of reinforcement learning (RL) the set of actions...
We study an online learning framework introduced by Mannor and Shamir (2...