Policy evaluation in reinforcement learning is often conducted using
two...
The Mallows model, introduced in the seminal paper of Mallows 1957, is o...
Web crawling is the problem of keeping a cache of webpages fresh, i.e.,
...
We study the problem of efficient online multiclass linear classificatio...
We quantify the separation between the numbers of labeled examples requi...
TD(0) is one of the most commonly used algorithms in reinforcement learn...
We provide two distributed confidence ball algorithms for solving linear...