Novel metrics for growth model selection

Matthew R. Grigsby; Junrui Di; Andrew Leroux; Vadim Zipunnikov; Luo Xiao; Ciprian Crainiceanu; William Checkley

doi:10.1186/s12982-018-0072-z

Emerging Themes in Epidemiology (Feb 2018)

Novel metrics for growth model selection

Matthew R. Grigsby,
Junrui Di,
Andrew Leroux,
Vadim Zipunnikov,
Luo Xiao,
Ciprian Crainiceanu,
William Checkley

Affiliations

Matthew R. Grigsby: Division of Pulmonary and Critical Care, School of Medicine, Johns Hopkins University
Junrui Di: Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health
Andrew Leroux: Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health
Vadim Zipunnikov: Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health
Luo Xiao: Department of Statistics, North Carolina State University
Ciprian Crainiceanu: Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health
William Checkley: Division of Pulmonary and Critical Care, School of Medicine, Johns Hopkins University

DOI: https://doi.org/10.1186/s12982-018-0072-z
Journal volume & issue: Vol. 15, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Background Literature surrounding the statistical modeling of childhood growth data involves a diverse set of potential models from which investigators can choose. However, the lack of a comprehensive framework for comparing non-nested models leads to difficulty in assessing model performance. This paper proposes a framework for comparing non-nested growth models using novel metrics of predictive accuracy based on modifications of the mean squared error criteria. Methods Three metrics were created: normalized, age-adjusted, and weighted mean squared error (MSE). Predictive performance metrics were used to compare linear mixed effects models and functional regression models. Prediction accuracy was assessed by partitioning the observed data into training and test datasets. This partitioning was constructed to assess prediction accuracy for backward (i.e., early growth), forward (i.e., late growth), in-range, and on new-individuals. Analyses were done with height measurements from 215 Peruvian children with data spanning from near birth to 2 years of age. Results Functional models outperformed linear mixed effects models in all scenarios tested. In particular, prediction errors for functional concurrent regression (FCR) and functional principal component analysis models were approximately 6% lower when compared to linear mixed effects models. When we weighted subject-specific MSEs according to subject-specific growth rates during infancy, we found that FCR was the best performer in all scenarios. Conclusion With this novel approach, we can quantitatively compare non-nested models and weight subgroups of interest to select the best performing growth model for a particular application or problem at hand.

Published in Emerging Themes in Epidemiology

ISSN: 1742-7622 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Infectious and parasitic diseases
Website: http://ete-online.biomedcentral.com

About the journal