Labeled data are critical to modern machine learning applications, but
o...
For infinite action contextual bandits, smoothed regret and reduction to...
Deep neural networks have great representation power, but typically requ...
Designing efficient general-purpose contextual bandit algorithms that wo...
A central problem in sequential decision making is to develop algorithms...
The goal of active learning is to achieve the same accuracy achievable b...
The model selection problem in the pure exploration linear bandit settin...
We study pure exploration in bandits, where the dimension of the feature...
We study a model selection problem in the linear bandit setting, where t...
We study the problem of Robust Outlier Arm Identification (ROAI), where ...
We study regret minimization problem with the existence of multiple
best...
Though deep neural network has hit a huge success in recent studies and
...