Hard for humans, hard for machines: predicting readmission after psychiatric hospitalization using narrative notes

William Boag; Olga Kovaleva; Thomas H. McCoy; Anna Rumshisky; Peter Szolovits; Roy H. Perlis

doi:10.1038/s41398-020-01104-w

Translational Psychiatry (Jan 2021)

Hard for humans, hard for machines: predicting readmission after psychiatric hospitalization using narrative notes

William Boag,
Olga Kovaleva,
Thomas H. McCoy,
Anna Rumshisky,
Peter Szolovits,
Roy H. Perlis

Affiliations

William Boag: Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology
Olga Kovaleva: Department of Computer Science, University of Massachusetts Lowell
Thomas H. McCoy: Center for Quantitative Health, Division of Clinical Research, Massachusetts General Hospital
Anna Rumshisky: Department of Computer Science, University of Massachusetts Lowell
Peter Szolovits: Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology
Roy H. Perlis: Center for Quantitative Health, Division of Clinical Research, Massachusetts General Hospital

DOI: https://doi.org/10.1038/s41398-020-01104-w
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 6

Abstract

Read online

Abstract Machine learning has been suggested as a means of identifying individuals at greatest risk for hospital readmission, including psychiatric readmission. We sought to compare the performance of predictive models that use interpretable representations derived via topic modeling to the performance of human experts and nonexperts. We examined all 5076 admissions to a general psychiatry inpatient unit between 2009 and 2016 using electronic health records. We developed multiple models to predict 180-day readmission for these admissions based on features derived from narrative discharge summaries, augmented by baseline sociodemographic and clinical features. We developed models using a training set comprising 70% of the cohort and evaluated on the remaining 30%. Baseline models using demographic features for prediction achieved an area under the curve (AUC) of 0.675 [95% CI 0.674–0.676] on an independent testing set, while language-based models also incorporating bag-of-words features, discharge summaries topics identified by Latent Dirichlet allocation (LDA), and prior psychiatric admissions achieved AUC of 0.726 [95% CI 0.725–0.727]. To characterize the difficulty of the task, we also compared the performance of these classifiers to both expert and nonexpert human raters, with and without feedback, on a subset of 75 test cases. These models outperformed humans on average, including predictions by experienced psychiatrists. Typical note tokens or topics associated with readmission risk were related to pregnancy/postpartum state, family relationships, and psychosis.

Published in Translational Psychiatry

ISSN: 2158-3188 (Online)
Publisher: Nature Publishing Group
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.nature.com/tp/index.html

About the journal