Multivariable prediction of functional outcome after first-episode psychosis: a crossover validation approach in EUFEST and PSYSCAN
Margot I. E. Slot,
Maria F. Urquijo Castro,
Inge Winter - van Rossum,
Hendrika H. van Hell,
Dominic Dwyer,
Paola Dazzan,
Arija Maat,
Lieuwe De Haan,
Benedicto Crespo-Facorro,
Birte Y. Glenthøj,
Stephen M. Lawrie,
Colm McDonald,
Oliver Gruber,
Thérèse van Amelsvoort,
Celso Arango,
Tilo Kircher,
Barnaby Nelson,
Silvana Galderisi,
Mark Weiser,
Gabriele Sachs,
Matthias Kirschner,
the PSYSCAN Consortium,
W. Wolfgang Fleischhacker,
Philip McGuire,
Nikolaos Koutsouleris,
René S. Kahn
Affiliations
Margot I. E. Slot
Department of Psychiatry, UMC Utrecht Brain Center, University Medical Center Utrecht
Maria F. Urquijo Castro
Department of Psychiatry and Psychotherapy, Ludwig-Maximilian-University
Inge Winter - van Rossum
Department of Psychiatry, UMC Utrecht Brain Center, University Medical Center Utrecht
Hendrika H. van Hell
Department of Psychiatry, UMC Utrecht Brain Center, University Medical Center Utrecht
Dominic Dwyer
Centre for Youth Mental Health, University of Melbourne
Paola Dazzan
Department of Psychological Medicine, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, De Crespigny Park
Arija Maat
Department of Psychiatry, UMC Utrecht Brain Center, University Medical Center Utrecht
Lieuwe De Haan
Amsterdam UMC, University of Amsterdam, Psychiatry, Department Early Psychosis
Benedicto Crespo-Facorro
Department of Psychiatry, Marqués de Valdecilla University Hospital, IDIVAL. School of Medicine, University of Cantabria
Birte Y. Glenthøj
Centre for Neuropsychiatric Schizophrenia Research (CNSR) & Centre for Clinical Intervention and Neuropsychiatric Schizophrenia Research (CINS), Mental Health Centre Glostrup
Stephen M. Lawrie
Division of Psychiatry, University of Edinburgh, Royal Edinburgh Hospital
Colm McDonald
Centre for Neuroimaging & Cognitive Genomics (NICOG), NCBES Galway Neuroscience Centre, National University of Ireland Galway
Oliver Gruber
Section for Experimental Psychopathology and Neuroimaging, Department of General Psychiatry, Heidelberg University
Thérèse van Amelsvoort
Department of Psychiatry and Neuropsychology, Maastricht University
Celso Arango
Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, IiSGM, CIBERSAM, ISCIII, School of Medicine, Universidad Complutense
Tilo Kircher
Department of Psychiatry, University of Marburg, Rudolf-Bultmann-Straße 8
Barnaby Nelson
Centre for Youth Mental Health, University of Melbourne
Silvana Galderisi
Department of Mental and Physical Health and Preventive Medicine, University of Campania Luigi Vanvitelli, Largo Madonna delle Grazie
Mark Weiser
Zachai Department of Psychiatry, Sheba Medical Center
Gabriele Sachs
Department of Psychiatry and Psychotherapy
Matthias Kirschner
Division of Adult Psychiatry, Department of Psychiatry, University Hospitals of Geneva
the PSYSCAN Consortium
W. Wolfgang Fleischhacker
Medical University of Innsbruck
Philip McGuire
Department of Psychiatry, University of Oxford
Nikolaos Koutsouleris
Department of Psychiatry and Psychotherapy, Ludwig-Maximilian-University
René S. Kahn
Department of Psychiatry, UMC Utrecht Brain Center, University Medical Center Utrecht
Abstract Several multivariate prognostic models have been published to predict outcomes in patients with first episode psychosis (FEP), but it remains unclear whether those predictions generalize to independent populations. Using a subset of demographic and clinical baseline predictors, we aimed to develop and externally validate different models predicting functional outcome after a FEP in the context of a schizophrenia-spectrum disorder (FES), based on a previously published cross-validation and machine learning pipeline. A crossover validation approach was adopted in two large, international cohorts (EUFEST, n = 338, and the PSYSCAN FES cohort, n = 226). Scores on the Global Assessment of Functioning scale (GAF) at 12 month follow-up were dichotomized to differentiate between poor (GAF current < 65) and good outcome (GAF current ≥ 65). Pooled non-linear support vector machine (SVM) classifiers trained on the separate cohorts identified patients with a poor outcome with cross-validated balanced accuracies (BAC) of 65-66%, but BAC dropped substantially when the models were applied to patients from a different FES cohort (BAC = 50–56%). A leave-site-out analysis on the merged sample yielded better performance (BAC = 72%), highlighting the effect of combining data from different study designs to overcome calibration issues and improve model transportability. In conclusion, our results indicate that validation of prediction models in an independent sample is essential in assessing the true value of the model. Future external validation studies, as well as attempts to harmonize data collection across studies, are recommended.