We propose a novel framework for analyzing the dynamics of distribution ...
We study two-player zero-sum stochastic games, and propose a form of
ind...
We initiate a principled study of algorithmic collective action on digit...
We propose an algorithm to solve a class of bi-level optimization proble...
We construct a zeroth-order gradient estimator for a smooth function def...
We study the efficiency of Thompson sampling for contextual bandits. Exi...
We study the problem of online learning in competitive settings in the
c...
As predictive models are deployed into the real world, they must increas...
Min-max optimization is emerging as a key framework for analyzing proble...
Distributionally robust supervised learning (DRSL) is emerging as a key
...
In this work we present a multi-armed bandit framework for online expert...
This paper proposes a framework for adaptively learning a feedback
linea...
Thompson sampling is a methodology for multi-armed bandit problems that ...
We prove that differential Nash equilibria are generic amongst local Nas...
We present a novel approach to control design for nonlinear systems, whi...
We show by counterexample that policy-gradient algorithms have no guaran...
Considering a class of gradient-based multi-agent learning algorithms in...
As learning algorithms are increasingly deployed in markets and other
co...
We address the problem of inverse reinforcement learning in Markov decis...