The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia

Nancy Rixe; Adam Frisch; Zhendong Wang; Judith M. Martin; Srinivasan Suresh; Srinivasan Suresh; Todd A. Florin; Sriram Ramgopal

doi:10.3389/fdgth.2023.1104604

Frontiers in Digital Health (Feb 2023)

The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia

Nancy Rixe,
Adam Frisch,
Zhendong Wang,
Judith M. Martin,
Srinivasan Suresh,
Srinivasan Suresh,
Todd A. Florin,
Sriram Ramgopal

Affiliations

Nancy Rixe: Division of Pediatric Emergency Medicine, UPMC Children’s Hospital of Pittsburgh, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
Adam Frisch: Department of Emergency Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
Zhendong Wang: School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, United States
Judith M. Martin: Division of General Academic Pediatrics, UPMC Children’s Hospital of Pittsburgh, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
Srinivasan Suresh: Division of Pediatric Emergency Medicine, UPMC Children’s Hospital of Pittsburgh, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
Srinivasan Suresh: Division of Health Informatics, UPMC Children’s Hospital of Pittsburgh, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
Todd A. Florin: Division of Emergency Medicine, Department of Pediatrics, Ann & Robert H. Lurie Children's Hospital of Chicago, Northwestern University Feinberg School of Medicine, Chicago, IL, United States
Sriram Ramgopal: Division of Emergency Medicine, Department of Pediatrics, Ann & Robert H. Lurie Children's Hospital of Chicago, Northwestern University Feinberg School of Medicine, Chicago, IL, United States

DOI: https://doi.org/10.3389/fdgth.2023.1104604
Journal volume & issue: Vol. 5

Abstract

Read online

ObjectiveChest radiographs are frequently used to diagnose community-acquired pneumonia (CAP) for children in the acute care setting. Natural language processing (NLP)-based tools may be incorporated into the electronic health record and combined with other clinical data to develop meaningful clinical decision support tools for this common pediatric infection. We sought to develop and internally validate NLP algorithms to identify pediatric chest radiograph (CXR) reports with pneumonia.Materials and methodsWe performed a retrospective study of encounters for patients from six pediatric hospitals over a 3-year period. We utilized six NLP techniques: word embedding, support vector machines, extreme gradient boosting (XGBoost), light gradient boosting machines Naïve Bayes and logistic regression. We evaluated their performance of each model from a validation sample of 1,350 chest radiographs developed as a stratified random sample of 35% admitted and 65% discharged patients when both using expert consensus and diagnosis codes.ResultsOf 172,662 encounters in the derivation sample, 15.6% had a discharge diagnosis of pneumonia in a primary or secondary position. The median patient age in the derivation sample was 3.7 years (interquartile range, 1.4–9.5 years). In the validation sample, 185/1350 (13.8%) and 205/1350 (15.3%) were classified as pneumonia by content experts and by diagnosis codes, respectively. Compared to content experts, Naïve Bayes had the highest sensitivity (93.5%) and XGBoost had the highest F1 score (72.4). Compared to a diagnosis code of pneumonia, the highest sensitivity was again with the Naïve Bayes (80.1%), and the highest F1 score was with the support vector machine (53.0%).ConclusionNLP algorithms can accurately identify pediatric pneumonia from radiography reports. Following external validation and implementation into the electronic health record, these algorithms can facilitate clinical decision support and inform large database research.

Published in Frontiers in Digital Health

ISSN: 2673-253X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Public aspects of medicine; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/digital-health#

About the journal

Abstract

Keywords