Predicting hospital admissions from individual patient data (IPD): an applied example to explore key elements driving external validity
Rafael Perera,
Marjan Van Den Akker,
Kym I E Snell,
Joerg J Meerpohl,
Ferdinand M Gerlach,
Paul Glasziou,
Daniela Küllenberg de Gaudry,
Ana Isabel González-González,
Jeanet Blom,
Christiane Muth,
Walter Emil Haefeli,
Karin M A Swart,
Petra Elders,
Henrik Rudolf,
Ulrich Thiem,
Andreas Daniel Meid,
Truc Sophia Dinh,
Donna Bosch-Lenders,
Hans J Trampisch,
Benno Flaig,
Ghainsom Kom
Affiliations
Rafael Perera
professor of medical statistics
Marjan Van Den Akker
Institute of General Practice, University of Frankfurt, Frankfurt am Main, Germany
Kym I E Snell
lecturer
Joerg J Meerpohl
Institute for Evidence in Medicine (for Cochrane Germany Foundation), Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
Ferdinand M Gerlach
1 Institute of General Practice, Goethe University Frankfurt, Frankfurt, Germany
Paul Glasziou
director
Daniela Küllenberg de Gaudry
researcher
Ana Isabel González-González
1 Institute of General Practice, Johann Wolfgang Goethe University, Frankfurt, Germany
Jeanet Blom
5 Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, Zuid-Holland, Netherlands
Christiane Muth
1 Institute of General Practice, Johann Wolfgang Goethe-University Frankfurt am Main, Frankfurt am Main, Hessen, Germany
Walter Emil Haefeli
Department of Clinical Pharmacology and Pharmacoepidemiology, University of Heidelberg, Heidelberg, Baden-Württemberg, Germany
Karin M A Swart
General Practice Medicine, Amsterdam UMC - Locatie VUmc, Vrije Universiteit, Amsterdam Public Health Research Institute, Amsterdam, Netherlands
Petra Elders
1 Department of General Practice & Elderly Care Medicine, Amsterdam Public Health Research Institute, VU University Medical Centre, Amsterdam, The Netherlands
Henrik Rudolf
Medical Informatics, Biometry and Epidemiology, Ruhr-University Bochum, Bochum, Germany
Ulrich Thiem
3Department of Geriatrics, Marienhospital Herne, University of Bochum, Herne, Germany
Andreas Daniel Meid
Department of Clinical Pharmacology & Pharmacoepidemiology, Heidelberg University, Heidelberg, Baden-Württemberg, Germany
Truc Sophia Dinh
Institute of General Practice, Goethe University, Frankfurt am Main, Hessen, Germany
Donna Bosch-Lenders
School of CAPHRI, Department of Family Medicine, Maastricht University, Maastricht, The Netherlands
Hans J Trampisch
Department of Medical Informatics, Biometry and Epidemiology, Ruhr University Bochum, Bochum, Nordrhein-Westfalen, Germany
Benno Flaig
Institute of General Practice, Goethe University, Frankfurt am Main, Hessen, Germany
Objective To explore factors that potentially impact external validation performance while developing and validating a prognostic model for hospital admissions (HAs) in complex older general practice patients.Study design and setting Using individual participant data from four cluster-randomised trials conducted in the Netherlands and Germany, we used logistic regression to develop a prognostic model to predict all-cause HAs within a 6-month follow-up period. A stratified intercept was used to account for heterogeneity in baseline risk between the studies. The model was validated both internally and by using internal-external cross-validation (IECV).Results Prior HAs, physical components of the health-related quality of life comorbidity index, and medication-related variables were used in the final model. While achieving moderate discriminatory performance, internal bootstrap validation revealed a pronounced risk of overfitting. The results of the IECV, in which calibration was highly variable even after accounting for between-study heterogeneity, agreed with this finding. Heterogeneity was equally reflected in differing baseline risk, predictor effects and absolute risk predictions.Conclusions Predictor effect heterogeneity and differing baseline risk can explain the limited external performance of HA prediction models. With such drivers known, model adjustments in external validation settings (eg, intercept recalibration, complete updating) can be applied more purposefully.Trial registration number PROSPERO id: CRD42018088129.