We study fully dynamic online selection problems in an adversarial/stoch...
We describe the current content moderation strategy employed by Meta to
...
In today's digital world, interaction with online platforms is ubiquitou...
We consider a multi-armed bandit setting where, at the beginning of each...
Moderating content in social media platforms is a formidable challenge d...
We study the problem of an online advertising system that wants to optim...
We consider a dynamic assortment selection problem where the goal is to ...
In this paper, we consider a novel variant of the multi-armed bandit (MA...
Recent advances in contextual bandit optimization and reinforcement lear...