Functional response regression with funBART: an analysis of patient-specific stillbirth risk
This article introduces functional BART, a new approach for functional response regression--that is, estimating a functional mean response f(t) that depends upon a set of scalar covariates x. Functional BART, or funBART, is based on the successful Bayesian Additive Regression Trees (BART) model. The original BART model is an ensemble of regression trees; funBART extends this model to an ensemble of functional regression trees, in which the terminal nodes of each tree are parametrized by functions rather than scalar responses. Just like the original BART model, funBART offers an appealing combination of flexibility with user friendliness: it captures complex nonlinear relationships and interactions among the predictors, while eliminating many of the onerous "researcher degrees of freedom" involved in function-on-scalar regression using standard tools. In particular, functional BART does not require the user to specify a functional form or basis set for f(t), to manually choose interactions, or to use a multi-step approach to select model terms or basis coefficients. Our model replaces all of these choices by a single smoothing parameter, which can either be chosen to reflect prior knowledge or tuned in a data-dependent way. After demonstrating the performance of the method on a series of benchmarking experiments, we then apply funBART to our motivating example: pregnancy-outcomes data from the National Center for Health Statistics. Here, the goal is to build a model for stillbirth risk as a function of gestational age (t), based on maternal and fetal risk factors (x). We show how these patient-specific estimates of stillbirth risk can be used to inform clinical management of high-risk pregnancies, with the aim of reducing the risk of perinatal mortality. The R package funbart implements all methods described in this article, and supplementary materials are available online.
READ FULL TEXT