We study the problem of bounding path-dependent expectations (within any...
We propose a new unbiased estimator for estimating the utility of the op...
We study the sequential batch learning problem in linear contextual band...
This paper proposes a novel non-parametric multidimensional convex regre...
We study the distributed simulation problem where n users aim to generat...
In this paper, we study the multi-armed bandit problem in the batched se...