Natural Language Processing and Graph Theory: Making Sense of Imaging Records in a Novel Representation Frame

Laurent Binsfeld Gonçalves; Ivan Nesic; Marko Obradovic; Bram Stieltjes; Thomas Weikert; Jens Bremerich

doi:10.2196/40534

JMIR Medical Informatics (Dec 2022)

Natural Language Processing and Graph Theory: Making Sense of Imaging Records in a Novel Representation Frame

Laurent Binsfeld Gonçalves,
Ivan Nesic,
Marko Obradovic,
Bram Stieltjes,
Thomas Weikert,
Jens Bremerich

Affiliations

Laurent Binsfeld Gonçalves: ORCiD
Ivan Nesic: ORCiD
Marko Obradovic: ORCiD
Bram Stieltjes: ORCiD
Thomas Weikert: ORCiD
Jens Bremerich: ORCiD

DOI: https://doi.org/10.2196/40534
Journal volume & issue: Vol. 10, no. 12
p. e40534

Abstract

Read online

BackgroundA concise visualization framework of related reports would increase readability and improve patient management. To this end, temporal referrals to prior comparative exams are an essential connection to previous exams in written reports. Due to unstructured narrative texts' variable structure and content, their extraction is hampered by poor computer readability. Natural language processing (NLP) permits the extraction of structured information from unstructured texts automatically and can serve as an essential input for such a novel visualization framework. ObjectiveThis study proposes and evaluates an NLP-based algorithm capable of extracting the temporal referrals in written radiology reports, applies it to all the radiology reports generated for 10 years, introduces a graphical representation of imaging reports, and investigates its benefits for clinical and research purposes. MethodsIn this single-center, university hospital, retrospective study, we developed a convolutional neural network capable of extracting the date of referrals from imaging reports. The model's performance was assessed by calculating precision, recall, and F1-score using an independent test set of 149 reports. Next, the algorithm was applied to our department's radiology reports generated from 2011 to 2021. Finally, the reports and their metadata were represented in a modulable graph. ResultsFor extracting the date of referrals, the named-entity recognition (NER) model had a high precision of 0.93, a recall of 0.95, and an F1-score of 0.94. A total of 1,684,635 reports were included in the analysis. Temporal reference was mentioned in 53.3% (656,852/1,684,635), explicitly stated as not available in 21.0% (258,386/1,684,635), and omitted in 25.7% (317,059/1,684,635) of the reports. Imaging records can be visualized in a directed and modulable graph, in which the referring links represent the connecting arrows. ConclusionsAutomatically extracting the date of referrals from unstructured radiology reports using deep learning NLP algorithms is feasible. Graphs refined the selection of distinct pathology pathways, facilitated the revelation of missing comparisons, and enabled the query of specific referring exam sequences. Further work is needed to evaluate its benefits in clinics, research, and resource planning.

Published in JMIR Medical Informatics

ISSN: 2291-9694 (Online)
Publisher: JMIR Publications
Country of publisher: Canada
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://medinform.jmir.org

About the journal