In this paper, we present empirical studies on conversational recommenda...
We consider estimation of parameters defined as linear functionals of
so...
In this paper, we present JoinGym, an efficient and lightweight
query op...
While distributional reinforcement learning (RL) has demonstrated empiri...
In this paper, we investigate the problem of offline reinforcement learn...
Estimating heterogeneous treatment effects from observational data is a
...
In this paper, we study nonparametric estimation of instrumental variabl...
In this paper, we study risk-sensitive Reinforcement Learning (RL), focu...
In offline reinforcement learning (RL) we have no opportunity to explore...
In many applications of online decision making, the environment is
non-s...
Sequential testing, always-valid p-values, and confidence sequences prom...
Reinforcement learning (RL) is one of the most vibrant research frontier...
Epistemic uncertainty quantification is a crucial part of drawing credib...
Safety is a crucial necessity in many applications of reinforcement lear...
We study generic inference on identified linear functionals of nonunique...
We study off-policy evaluation (OPE) for partially observable MDPs (POMD...
We study representation learning for Offline Reinforcement Learning (RL)...
We study reinforcement learning with function approximation for large-sc...
We study Reinforcement Learning for partially observable dynamical syste...
The conditional average treatment effect (CATE) is the best point predic...
The fundamental problem of causal inference – that we never observe
coun...
In machine learning, disparity metrics are often defined by measuring th...
Off-policy evaluation and learning (OPE/L) use offline observational dat...
We study the identification and estimation of long-term treatment effect...
Since the average treatment effect (ATE) measures the change in social
w...
We study the problem of constructing bounds on the average treatment eff...
We conduct an empirical evaluation of the impact of New York's bail refo...
In applications of offline reinforcement learning to observational data,...
We study off-policy evaluation and learning from sequential data in a
st...
Exploration is a crucial aspect of bandit and reinforcement learning
alg...
We develop a new approach for identifying and estimating average causal
...
We study the problem of off-policy evaluation from batched contextual ba...
Empirical risk minimization (ERM) is the workhorse of machine learning,
...
Contextual bandit algorithms are increasingly replacing non-adaptive A/B...
We study the estimation of causal parameters when not all confounders ar...
We offer a theoretical characterization of off-policy evaluation (OPE) i...
We study the regret of reinforcement learning from offline data generate...
We study the interplay of fairness, welfare, and equity considerations i...
The conditional moment problem is a powerful formulation for describing
...
I provide a rejoinder for discussion of "More Efficient Policy Learning ...
Incorporating side observations of predictive features can help reduce
u...
We study off-policy evaluation (OPE) from multiple logging policies, eac...
We study conditional stochastic optimization problems, where we leverage...
Off-policy evaluation (OPE) in reinforcement learning is an important pr...
Offline reinforcement learning, wherein one uses off-policy data logged ...
We study the efficient off-policy evaluation of natural stochastic polic...
I study the minimax-optimal design for a two-arm controlled experiment w...
Dynamic treatment regimes (DTRs) for are personalized, sequential treatm...
I congratulate Profs. Binyan Jiang, Rui Song, Jialiang Li, and Donglin Z...
We study the problem of estimating treatment effects when the outcome of...