Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing

Christina M. Lineback; Ravi Garg; Elissa Oh; Andrew M. Naidech; Andrew M. Naidech; Jane L. Holl; Shyam Prabhakaran

doi:10.3389/fneur.2021.649521

Frontiers in Neurology (Jul 2021)

Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing

Christina M. Lineback,
Ravi Garg,
Elissa Oh,
Andrew M. Naidech,
Andrew M. Naidech,
Jane L. Holl,
Shyam Prabhakaran

Affiliations

Christina M. Lineback: Department of Neurology, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States
Ravi Garg: Department of Neurology, Biological Sciences, Division and Center for Healthcare Delivery Science and Innovation, University of Chicago, Chicago, IL, United States
Elissa Oh: Department of Neurology, Biological Sciences, Division and Center for Healthcare Delivery Science and Innovation, University of Chicago, Chicago, IL, United States
Andrew M. Naidech: Department of Neurology, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States
Andrew M. Naidech: Department of Neurology, Biological Sciences, Division and Center for Healthcare Delivery Science and Innovation, University of Chicago, Chicago, IL, United States
Jane L. Holl: Department of Neurology, Biological Sciences, Division and Center for Healthcare Delivery Science and Innovation, University of Chicago, Chicago, IL, United States
Shyam Prabhakaran: Department of Neurology, University of Chicago, Chicago, IL, United States

DOI: https://doi.org/10.3389/fneur.2021.649521
Journal volume & issue: Vol. 12

Abstract

Read online

Background and Purpose: This study aims to determine whether machine learning (ML) and natural language processing (NLP) from electronic health records (EHR) improve the prediction of 30-day readmission after stroke.Methods: Among index stroke admissions between 2011 and 2016 at an academic medical center, we abstracted discrete data from the EHR on demographics, risk factors, medications, hospital complications, and discharge destination and unstructured textual data from clinician notes. Readmission was defined as any unplanned hospital admission within 30 days of discharge. We developed models to predict two separate outcomes, as follows: (1) 30-day all-cause readmission and (2) 30-day stroke readmission. We compared the performance of logistic regression with advanced ML algorithms. We used several NLP methods to generate additional features from unstructured textual reports. We evaluated the performance of prediction models using a five-fold validation and tested the best model in a held-out test dataset. Areas under the curve (AUCs) were used to compare discrimination of each model.Results: In a held-out test dataset, advanced ML methods along with NLP features out performed logistic regression for all-cause readmission (AUC, 0.64 vs. 0.58; p < 0.001) and stroke readmission prediction (AUC, 0.62 vs. 0.52; p < 0.001).Conclusion: NLP-enhanced machine learning models potentially advance our ability to predict readmission after stroke. However, further improvement is necessary before being implemented in clinical practice given the weak discrimination.

Published in Frontiers in Neurology

ISSN: 1664-2295 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry: Neurology. Diseases of the nervous system
Website: https://www.frontiersin.org/journals/neurology/

About the journal

Abstract

Keywords