ARDSFlag: an NLP/machine learning algorithm to visualize and detect high-probability ARDS admissions independent of provider recognition and billing codes

Amir Gandomi; Phil Wu; Daniel R Clement; Jinyan Xing; Rachel Aviv; Matthew Federbush; Zhiyong Yuan; Yajun Jing; Guangyao Wei; Negin Hajizadeh

doi:10.1186/s12911-024-02573-5

BMC Medical Informatics and Decision Making (Jul 2024)

ARDSFlag: an NLP/machine learning algorithm to visualize and detect high-probability ARDS admissions independent of provider recognition and billing codes

Amir Gandomi,
Phil Wu,
Daniel R Clement,
Jinyan Xing,
Rachel Aviv,
Matthew Federbush,
Zhiyong Yuan,
Yajun Jing,
Guangyao Wei,
Negin Hajizadeh

Affiliations

Amir Gandomi: Frank G. Zarb School of Business, Hofstra University
Phil Wu: AiD Technologies
Daniel R Clement: Donald and Barbara Zucker School of Medicine at Hofstra/Northwell
Jinyan Xing: Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University
Rachel Aviv: Kaiser Permanente
Matthew Federbush: Donald and Barbara Zucker School of Medicine at Hofstra/Northwell
Zhiyong Yuan: Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University
Yajun Jing: Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University
Guangyao Wei: Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University
Negin Hajizadeh: Institute of Health System Science, Feinstein Institute for Medical Research

DOI: https://doi.org/10.1186/s12911-024-02573-5
Journal volume & issue: Vol. 24, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Despite the significance and prevalence of acute respiratory distress syndrome (ARDS), its detection remains highly variable and inconsistent. In this work, we aim to develop an algorithm (ARDSFlag) to automate the diagnosis of ARDS based on the Berlin definition. We also aim to develop a visualization tool that helps clinicians efficiently assess ARDS criteria. Methods ARDSFlag applies machine learning (ML) and natural language processing (NLP) techniques to evaluate Berlin criteria by incorporating structured and unstructured data in an electronic health record (EHR) system. The study cohort includes 19,534 ICU admissions in the Medical Information Mart for Intensive Care III (MIMIC-III) database. The output is the ARDS diagnosis, onset time, and severity. Results ARDSFlag includes separate text classifiers trained using large training sets to find evidence of bilateral infiltrates in radiology reports (accuracy of 91.9%±0.5%) and heart failure/fluid overload in radiology reports (accuracy 86.1%±0.5%) and echocardiogram notes (accuracy 98.4%±0.3%). A test set of 300 cases, which was blindly and independently labeled for ARDS by two groups of clinicians, shows that ARDSFlag generates an overall accuracy of 89.0% (specificity = 91.7%, recall = 80.3%, and precision = 75.0%) in detecting ARDS cases. Conclusion To our best knowledge, this is the first study to focus on developing a method to automate the detection of ARDS. Some studies have developed and used other methods to answer other research questions. Expectedly, ARDSFlag generates a significantly higher performance in all accuracy measures compared to those methods.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords