We present the first framework to solve linear inverse problems leveragi...
The study of collaborative multi-agent bandits has attracted significant...
The goal of contrasting learning is to learn a representation that prese...
This work considers the problem of finding a first-order stationary poin...
We provide a theoretical justification for sample recovery using diffusi...
Advances in computer vision and machine learning enable robots to percei...
One method for obtaining generalizable solutions to machine learning tas...
The stochastic multi-armed bandit setting has been recently studied in t...
Cascading bandits model the task of learning to rank K out of L items
ov...
We consider a multi-agent multi-armed bandit setting in which n honest
a...
We study convergence rates of AdaGrad-Norm as an exemplar of adaptive
st...
Recent empirical evidence has driven conventional wisdom to believe that...
For the misspecified linear Markov decision process (MLMDP) model of Jin...
We study a version of the contextual bandit problem where an agent is gi...
In temporal difference (TD) learning, off-policy sampling is known to be...
Pre-trained deep nets are commonly used to improve accuracies and traini...
In multi-server queueing systems where there is no central queue holding...
Recent work has considered natural variations of the multi-armed bandit
...
We propose two algorithms for episodic stochastic shortest path problems...
We propose to accelerate existing linear bandit algorithms to achieve
pe...
This paper develops an unified framework to study finite-sample converge...
We consider a variant of the traditional multi-armed bandit problem in w...
We study a variant of the stochastic linear bandit problem wherein we
op...
Model-Agnostic Meta-Learning (MAML) has demonstrated widespread success ...
In the regret-based formulation of multi-armed bandit (MAB) problems, ex...
There has been recent interest in collaborative multi-agent bandits, whe...
We study a multi-agent stochastic linear bandit with side information,
p...
We study a novel variant of the multi-armed bandit problem, where at eac...
We study a variant of the multi-armed bandit problem where side informat...
The Model-Agnostic Meta-Learning (MAML) algorithm <cit.> has
been celebr...
Stochastic Approximation (SA) is a popular approach for solving fixed po...
We consider a decentralized multi-agent Multi Armed Bandit (MAB) setup
c...
We explore application of multi-armed bandit algorithms to statistical m...
In this paper, we introduce a distributed version of the classical stoch...
We consider a novel stochastic multi-armed bandit setting, where playing...
We consider a co-variate shift problem where one has access to several
m...
We study the problem of black-box optimization of a noisy function in th...
We consider the problem of discovering the simplest latent variable that...
Deep generative networks can simulate from a complex target distribution...
We introduce a new malware detector - Shape-GD - that aggregates per-mac...
We consider the problem of contextual bandits with stochastic experts, w...
We consider learning-based variants of the c μ rule for scheduling in
si...
We consider learning-based variants of the c μ rule -- a classic and
wel...
Emerging 5G systems will need to efficiently support both broadband traf...
We consider the problem of non-parametric Conditional Independence testi...
Motivated by applications in computational advertising and systems biolo...
We consider the task of learning the parameters of a single component
o...
Motivated by online recommendation and advertising systems, we consider ...
We propose a new yet natural algorithm for learning the graph structure ...