Creating sparser prediction models of treatment outcome in depression: a proof-of-concept study using simultaneous feature selection and hyperparameter tuning

Nicolas Rost; Tanja M. Brückl; Nikolaos Koutsouleris; Elisabeth B. Binder; Bertram Müller-Myhsok

doi:10.1186/s12911-022-01926-2

BMC Medical Informatics and Decision Making (Jul 2022)

Creating sparser prediction models of treatment outcome in depression: a proof-of-concept study using simultaneous feature selection and hyperparameter tuning

Nicolas Rost,
Tanja M. Brückl,
Nikolaos Koutsouleris,
Elisabeth B. Binder,
Bertram Müller-Myhsok

Affiliations

Nicolas Rost: Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry
Tanja M. Brückl: Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry
Nikolaos Koutsouleris: Department of Psychiatry and Psychotherapy, Ludwig Maximilian University
Elisabeth B. Binder: Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry
Bertram Müller-Myhsok: Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry

DOI: https://doi.org/10.1186/s12911-022-01926-2
Journal volume & issue: Vol. 22, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Background Predicting treatment outcome in major depressive disorder (MDD) remains an essential challenge for precision psychiatry. Clinical prediction models (CPMs) based on supervised machine learning have been a promising approach for this endeavor. However, only few CPMs have focused on model sparsity even though sparser models might facilitate the translation into clinical practice and lower the expenses of their application. Methods In this study, we developed a predictive modeling pipeline that combines hyperparameter tuning and recursive feature elimination in a nested cross-validation framework. We applied this pipeline to a real-world clinical data set on MDD treatment response and to a second simulated data set using three different classification algorithms. Performance was evaluated by permutation testing and comparison to a reference pipeline without nested feature selection. Results Across all models, the proposed pipeline led to sparser CPMs compared to the reference pipeline. Except for one comparison, the proposed pipeline resulted in equally or more accurate predictions. For MDD treatment response, balanced accuracy scores ranged between 61 and 71% when models were applied to hold-out validation data. Conclusions The resulting models might be particularly interesting for clinical applications as they could reduce expenses for clinical institutions and stress for patients.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords