Classical analysis of convex and non-convex optimization methods often
r...
We consider the problem of interactive decision making, encompassing
str...
In this paper, we study a linear bandit optimization problem in a federa...
A fundamental challenge in interactive learning and decision making, ran...
We study the problem of robust learning under clean-label data-poisoning...
Multi-armed bandits are widely applied in scenarios like recommender sys...
We study minimax optimal reinforcement learning in episodic factored Mar...
We investigate concentration inequalities for Dirichlet and Multinomial
...
Importance sampling (IS) is a common reweighting strategy for off-policy...
We introduce and analyse two algorithms for exploration-exploitation in
...