In this paper, we introduce a generalization of the standard Stackelberg...
Offline reinforcement learning (RL), which refers to decision-making fro...
In this paper, we study oracle-efficient algorithms for beyond worst-cas...
The First-Come First-Served (FCFS) scheduling policy is the most popular...
This paper presents the first non-asymptotic result showing that a model...