Leveraging Large-Scale Electronic Health Records and Interpretable Machine Learning for Clinical Decision Making at the Emergency Department: Protocol for System Development and Validation

Nan Liu; Feng Xie; Fahad Javaid Siddiqui; Andrew Fu Wah Ho; Bibhas Chakraborty; Gayathri Devi Nadarajan; Kenneth Boon Kiat Tan; Marcus Eng Hock Ong

doi:10.2196/34201

JMIR Research Protocols (Mar 2022)

Leveraging Large-Scale Electronic Health Records and Interpretable Machine Learning for Clinical Decision Making at the Emergency Department: Protocol for System Development and Validation

Nan Liu,
Feng Xie,
Fahad Javaid Siddiqui,
Andrew Fu Wah Ho,
Bibhas Chakraborty,
Gayathri Devi Nadarajan,
Kenneth Boon Kiat Tan,
Marcus Eng Hock Ong

Affiliations

Nan Liu: ORCiD
Feng Xie: ORCiD
Fahad Javaid Siddiqui: ORCiD
Andrew Fu Wah Ho: ORCiD
Bibhas Chakraborty: ORCiD
Gayathri Devi Nadarajan: ORCiD
Kenneth Boon Kiat Tan: ORCiD
Marcus Eng Hock Ong: ORCiD

DOI: https://doi.org/10.2196/34201
Journal volume & issue: Vol. 11, no. 3
p. e34201

Abstract

Read online

BackgroundThere is a growing demand globally for emergency department (ED) services. An increase in ED visits has resulted in overcrowding and longer waiting times. The triage process plays a crucial role in assessing and stratifying patients’ risks and ensuring that the critically ill promptly receive appropriate priority and emergency treatment. A substantial amount of research has been conducted on the use of machine learning tools to construct triage and risk prediction models; however, the black box nature of these models has limited their clinical application and interpretation. ObjectiveIn this study, we plan to develop an innovative, dynamic, and interpretable System for Emergency Risk Triage (SERT) for risk stratification in the ED by leveraging large-scale electronic health records (EHRs) and machine learning. MethodsTo achieve this objective, we will conduct a retrospective, single-center study based on a large, longitudinal data set obtained from the EHRs of the largest tertiary hospital in Singapore. Study outcomes include adverse events experienced by patients, such as the need for an intensive care unit and inpatient death. With preidentified candidate variables drawn from expert opinions and relevant literature, we will apply an interpretable machine learning–based AutoScore to develop 3 SERT scores. These 3 scores can be used at different times in the ED, that is, on arrival, during ED stay, and at admission. Furthermore, we will compare our novel SERT scores with established clinical scores and previously described black box machine learning models as baselines. Receiver operating characteristic analysis will be conducted on the testing cohorts for performance evaluation. ResultsThe study is currently being conducted. The extracted data indicate approximately 1.8 million ED visits by over 810,000 unique patients. Modelling results are expected to be published in 2022. ConclusionsThe SERT scoring system proposed in this study will be unique and innovative because of its dynamic nature and modelling transparency. If successfully validated, our proposed solution will establish a standard for data processing and modelling by taking advantage of large-scale EHRs and interpretable machine learning tools. International Registered Report Identifier (IRRID)DERR1-10.2196/34201

Published in JMIR Research Protocols

ISSN: 1929-0748 (Online)
Publisher: JMIR Publications
Country of publisher: Canada
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.researchprotocols.org

About the journal