Repeatedly measured predictors: a comparison of methods for prediction modeling

Marieke Welten; Marlou L. A. de Kroon; Carry M. Renders; Ewout W. Steyerberg; Hein Raat; Jos W. R. Twisk; Martijn W. Heymans

doi:10.1186/s41512-018-0024-7

Diagnostic and Prognostic Research (Feb 2018)

Repeatedly measured predictors: a comparison of methods for prediction modeling

Marieke Welten,
Marlou L. A. de Kroon,
Carry M. Renders,
Ewout W. Steyerberg,
Hein Raat,
Jos W. R. Twisk,
Martijn W. Heymans

Affiliations

Marieke Welten: Department of Epidemiology and Biostatistics, Amsterdam Public Health Research Institute, VU Medical Center
Marlou L. A. de Kroon: Department of Public Health, Erasmus Medical Center
Carry M. Renders: Department of Health Sciences, Amsterdam Public Health Research Institute, VU University Amsterdam
Ewout W. Steyerberg: Department of Public Health, Erasmus Medical Center
Hein Raat: Department of Public Health, Erasmus Medical Center
Jos W. R. Twisk: Department of Epidemiology and Biostatistics, Amsterdam Public Health Research Institute, VU Medical Center
Martijn W. Heymans: Department of Epidemiology and Biostatistics, Amsterdam Public Health Research Institute, VU Medical Center

DOI: https://doi.org/10.1186/s41512-018-0024-7
Journal volume & issue: Vol. 2, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Background In literature, not much emphasis has been placed on methods for analyzing repeatedly measured independent variables, even less so for the use in prediction modeling specifically. However, repeated measurements could especially be interesting for the construction of prediction models. Therefore, our objective was to evaluate different methods to model a repeatedly measured independent variable and a long-term fixed outcome variable into a prediction model. Methods Six methods to handle a repeatedly measured predictor were applied to develop prediction models. Methods were evaluated with respect to the models’ predictive quality (explained variance R 2 and the area under the curve (AUC)) and their properties were discussed. The models included overweight and BMI-standard deviation score (BMI-SDS) at age 10 years as outcome and seven BMI-SDS measurements between 0 and 5.5 years as longitudinal predictor. Methods for comparison encompassed developing models including: all measurements; a single (here: the last) measurement; a mean or maximum value of all measurements; changes between subsequent measurements; conditional measurements; and growth curve parameters. Results All methods, except for using the maximum or mean, resulted in prediction models for overweight of similar predictive quality, with adjusted Nagelkerke R 2 ranging between 0.230 and 0.244 and AUC ranging between 0.799 and 0.807. Continuous BMI-SDS prediction showed similar results. Conclusions The choice of method depends on hypothesized predictor-outcome associations, available data, and requirements of the prediction model. Overall, the growth curve method seems to be the most flexible method capable of incorporating longitudinal predictor information without loss in predictive quality.

Published in Diagnostic and Prognostic Research

ISSN: 2397-7523 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General)
Website: https://diagnprognres.biomedcentral.com/

About the journal

Abstract

Keywords