Predicting pregnancy outcomes using longitudinal information: a penalized splines mixed-effects model approach

Thumbnail Image
Publication date
Defense date
Journal Title
Journal ISSN
Volume Title
John Wiley and Sons
Google Scholar
Research Projects
Organizational Units
Journal Issue
We propose a semiparametric nonlinear mixed-effects model (SNMM) using penalized splines to classify longitudinal data and improve the prediction of a binary outcome. The work is motivated by a study in which different hormone levels were measured during the early stages of pregnancy, and the challenge is using this information to predict normal versus abnormal pregnancy outcomes. The aim of this paper is to compare models and estimation strategies on the basis of alternative formulations of SNMMs depending on the characteristics of the data set under consideration. For our motivating example, we address the classification problem using a particular case of the SNMM in which the parameter space has a finite dimensional component (fixed effects and variance components) and an infinite dimensional component (unknown function) that need to be estimated. The nonparametric component of the model is estimated using penalized splines. For the parametric component, we compare the advantages of using random effects versus direct modeling of the correlation structure of the errors. Numerical studies show that our approach improves over other existing methods for the analysis of this type of data. Furthermore, the results obtained using our method support the idea that explicit modeling of the serial correlation of the error term improves the prediction accuracy with respect to a model with random effects, but independent errors.
Classification models, Correlated observations, Longitudinal data, Mixed-effects models, P-splines, Lasso-type estimators, Bayesian classification, Regression-analysis, Correlated errors, P-splines, HCG
Bibliographic citation
De la Cruz, R., Fuentes, C., Meza, C., Lee, D.-J., & Arribas-Gil, A. (2017). Predicting pregnancy outcomes using longitudinal information: a penalized splines mixed-effects model approach. Statistics in Medicine, 36(13), 2120–2134