In this paper, we study the problem of optimal data collection for polic...
In this paper, we consider the setting of piecewise i.i.d. bandits under...
This paper studies the problem of data collection for policy evaluation ...
The level set estimation problem seeks to find all points in a domain X ...
Active learning and structured stochastic bandit problems are intimately...
We consider the setup of stochastic multi-armed bandits in the case when...