We consider an improper reinforcement learning setting where a learner i...
We consider a system of several collocated nodes sharing a time slotted
...
We consider an improper reinforcement learning setting where the learner...
Motivated by medium access control for resource-challenged wireless Inte...
We give a new algorithm for best arm identification in linearly paramete...