The problem of matching markets has been studied for a long time in the
...
The linear bandit problem has been studied for many years in both stocha...
Learning Markov decision processes (MDP) in an adversarial environment h...
The problem of online learning with graph feedback has been extensively
...
The problem of two-sided matching markets has a wide range of real-world...
Thompson sampling (TS) has attracted a lot of interest in the bandit are...
Online influence maximization (OIM) is a popular problem in social netwo...
Due to its great importance in deep natural language understanding and
v...