This work aims to study off-policy evaluation (OPE) under scenarios wher...
The Markov property is widely imposed in analysis of time series data.
C...
Many modern tech companies, such as Google, Uber, and Didi, utilize onli...
In real-world applications of reinforcement learning, it is often challe...
Mediation analysis learns the causal effect transmitted via mediator
var...
Dynamic treatment regimes assign personalized treatments to patients
seq...
Off-policy evaluation (OPE) is a method for estimating the return of a t...
Off-Policy evaluation (OPE) is concerned with evaluating a new target po...
Reinforcement learning (RL) is one of the most vibrant research frontier...
This paper studies reinforcement learning (RL) in doubly inhomogeneous
e...
In this article, we propose a novel pessimism-based Bayesian learning me...
We introduce super reinforcement learning in the batch setting, which ta...
We propose a sure screening approach for recovering the structure of a
t...
We study off-policy evaluation (OPE) for partially observable MDPs (POMD...
Off-policy evaluation is critical in a number of applications where new
...
We consider reinforcement learning (RL) methods in offline nonstationary...
We consider reinforcement learning (RL) methods in offline domains witho...
Policy evaluation based on A/B testing has attracted considerable intere...
This paper is concerned with constructing a confidence interval for a ta...
The two-sided markets such as ride-sharing companies often involve a gro...
An individualized decision rule (IDR) is a decision function that assign...
We consider off-policy evaluation (OPE) in Partially Observable Markov
D...
Tech companies (e.g., Google or Facebook) often use randomized online
ex...
In this article, we propose a new hypothesis testing method for directed...
Order dispatch is one of the central problems to ride-sharing platforms....
Off-policy evaluation learns a target policy's value with a historical
d...
We consider off-policy evaluation (OPE) in continuous action domains, su...
In this article, we consider the problem of high-dimensional conditional...
Mediation analysis is becoming an increasingly important tool in scienti...
The Markov assumption (MA) is fundamental to the empirical validity of
r...
A/B testing, or online experiment is a standard business strategy to com...
In order to identify important variables that are involved in making opt...