Transfer reinforcement learning aims to improve the sample efficiency of...
Upper Confidence Bound (UCB) is arguably the most commonly used method f...
We study contextual multi-armed bandit problems in the case of multiple
...
We provide a theoretical analysis of the representation learning problem...
Refrigeration and chiller optimization is an important and well studied ...
In this work, we study recommendation systems modelled as contextual
mul...