We consider the problem of learning a function respecting a symmetry fro...
As noted in the works of <cit.>, it has been mentioned
that it is an ope...
We present a non-asymptotic lower bound on the eigenspectrum of the desi...
We consider an improper reinforcement learning setting where a learner i...
Q-learning and SARSA(0) with linear function approximation, under
ϵ-gree...
We consider the problem of sequentially learning to estimate, in the mea...
We revisit the method of mixture technique, also known as the Laplace me...
We consider minimisation of dynamic regret in non-stationary bandits wit...
We consider the following network model motivated, in particular, by
blo...
Detecting abrupt changes in temporal behavior patterns is of interest in...
We consider the problem of scheduling in constrained queueing networks w...
We consider an improper reinforcement learning setting where the learner...
We study a variant of the stochastic linear bandit problem wherein we
op...
We consider multi-objective optimization (MOO) of an unknown vector-valu...
We consider a multi-hypothesis testing problem involving a K-armed bandi...
We study the problem of best arm identification in linearly parameterise...
The number of confirmed cases of COVID-19 is often used as a proxy for t...
We consider the problem of stochastic K-armed dueling bandit in the
cont...
Motivated by medium access control for resource-challenged wireless Inte...
We consider the problem of PAC learning the most valuable item from a po...
The blockchain paradigm provides a mechanism for content dissemination a...
We consider the problem of adaptively PAC-learning a probability distrib...
We give a new algorithm for best arm identification in linearly paramete...
We develop algorithms with low regret for learning episodic Markov decis...
We present two algorithms for Bayesian optimization in the batch feedbac...
We study how to adapt to smoothly-varying (`easy') environments in well-...
We consider black box optimization of an unknown function in the
nonpara...
We consider PAC learning for identifying a good item from subset-wise sa...
We consider two regret minimisation problems over subsets of a finite gr...
We consider the problem of probably approximately correct (PAC) ranking ...
We introduce the probably approximately correct (PAC) version of the pro...
We consider online learning for minimizing regret in unknown, episodic M...
The problem of detecting an odd arm from a set of K arms of a multi-arme...
We consider a collaborative online learning paradigm, wherein a group of...
We consider reinforcement learning in parameterized Markov Decision Proc...