Ranking interfaces are everywhere in online platforms. There is thus an ...
Off-policy evaluation (OPE) attempts to predict the performance of
count...
Off-policy evaluation (OPE) aims to accurately evaluate the performance ...
Dynamic treatment regimes (DTRs) are sequences of decision rules that
re...
In real-world recommender systems and search engines, optimizing ranking...
Off-policy Evaluation (OPE), or offline evaluation in general, evaluates...
Algorithms produce a growing portion of decisions and recommendations bo...
Countries with more democratic political regimes experienced greater GDP...
Many schools in large urban districts have more applicants than seats.
C...
We build and publicize the Open Bandit Dataset and Pipeline to facilitat...
We develop a method for predicting the performance of reinforcement lear...
Many scientific experiments have an interest in the estimation of the av...
What is the most statistically efficient way to do off-policy evaluation...