Complex modeling with detailed temporal predictors does not improve health records-based suicide risk prediction

Susan M. Shortreed; Rod L. Walker; Eric Johnson; Robert Wellman; Maricela Cruz; Rebecca Ziebell; R. Yates Coley; Zimri S. Yaseen; Sai Dharmarajan; Robert B. Penfold; Brian K. Ahmedani; Rebecca C. Rossom; Arne Beck; Jennifer M. Boggs; Greg E. Simon

doi:10.1038/s41746-023-00772-4

npj Digital Medicine (Mar 2023)

Complex modeling with detailed temporal predictors does not improve health records-based suicide risk prediction

Susan M. Shortreed,
Rod L. Walker,
Eric Johnson,
Robert Wellman,
Maricela Cruz,
Rebecca Ziebell,
R. Yates Coley,
Zimri S. Yaseen,
Sai Dharmarajan,
Robert B. Penfold,
Brian K. Ahmedani,
Rebecca C. Rossom,
Arne Beck,
Jennifer M. Boggs,
Greg E. Simon

Affiliations

Susan M. Shortreed: Kaiser Permanente Washington Health Research Institute
Rod L. Walker: Kaiser Permanente Washington Health Research Institute
Eric Johnson: Kaiser Permanente Washington Health Research Institute
Robert Wellman: Kaiser Permanente Washington Health Research Institute
Maricela Cruz: Kaiser Permanente Washington Health Research Institute
Rebecca Ziebell: Kaiser Permanente Washington Health Research Institute
R. Yates Coley: Kaiser Permanente Washington Health Research Institute
Zimri S. Yaseen: U.S. Food and Drug Administration
Sai Dharmarajan: U.S. Food and Drug Administration
Robert B. Penfold: Kaiser Permanente Washington Health Research Institute
Brian K. Ahmedani: Center for Health Policy & Health Services Research, Henry Ford Health System
Rebecca C. Rossom: HealthPartners Institute, Division of Research
Arne Beck: Kaiser Permanente Colorado Institute for Health Research
Jennifer M. Boggs: Kaiser Permanente Colorado Institute for Health Research
Greg E. Simon: Kaiser Permanente Washington Health Research Institute

DOI: https://doi.org/10.1038/s41746-023-00772-4
Journal volume & issue: Vol. 6, no. 1
pp. 1 – 22

Abstract

Read online

Abstract Suicide risk prediction models can identify individuals for targeted intervention. Discussions of transparency, explainability, and transportability in machine learning presume complex prediction models with many variables outperform simpler models. We compared random forest, artificial neural network, and ensemble models with 1500 temporally defined predictors to logistic regression models. Data from 25,800,888 mental health visits made by 3,081,420 individuals in 7 health systems were used to train and evaluate suicidal behavior prediction models. Model performance was compared across several measures. All models performed well (area under the receiver operating curve [AUC]: 0.794–0.858). Ensemble models performed best, but improvements over a regression model with 100 predictors were minimal (AUC improvements: 0.006–0.020). Results are consistent across performance metrics and subgroups defined by race, ethnicity, and sex. Our results suggest simpler parametric models, which are easier to implement as part of routine clinical practice, perform comparably to more complex machine learning methods.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal