This work aims to study off-policy evaluation (OPE) under scenarios wher...
In this paper, we propose a policy gradient method for confounded partia...
In real-world applications of reinforcement learning, it is often challe...
Pricing based on individual customer characteristics is widely used to
m...
We consider a class of assortment optimization problems in an offline
da...
Batch reinforcement learning (RL) aims at finding an optimal policy in a...
Motivated by the human-machine interaction such as training chatbots for...
This paper introduces RISE, a robust individualized decision learning
fr...
In this article, we propose a novel pessimism-based Bayesian learning me...
We introduce super reinforcement learning in the batch setting, which ta...
We study the problem of off-policy evaluation (OPE) for episodic Partial...
We study the offline reinforcement learning (RL) in the face of unmeasur...
We study the change point detection problem for high-dimensional linear
...
We study the off-policy evaluation (OPE) problem in an infinite-horizon
...
Deep Reinforcement Learning (DRL) has demonstrated great potentials in
s...
We thank the opportunity offered by editors for this discussion and the
...
Offline policy evaluation (OPE) is considered a fundamental and challeng...
Data-driven individualized decision making has recently received increas...
We study the sequential decision making problem in Markov decision proce...
We consider the batch (off-line) policy learning problem in the infinite...
Recent development in the data-driven decision science has seen great
ad...
This paper has two main goals: (a) establish several statistical
propert...
Recent exploration of optimal individualized decision rules (IDRs) for
p...
With the emergence of precision medicine, estimating optimal individuali...