A systematic review of natural language processing applied to radiology reports

Arlene Casey; Emma Davidson; Michael Poon; Hang Dong; Daniel Duma; Andreas Grivas; Claire Grover; Víctor Suárez-Paniagua; Richard Tobin; William Whiteley; Honghan Wu; Beatrice Alex

doi:10.1186/s12911-021-01533-7

BMC Medical Informatics and Decision Making (Jun 2021)

A systematic review of natural language processing applied to radiology reports

Arlene Casey,
Emma Davidson,
Michael Poon,
Hang Dong,
Daniel Duma,
Andreas Grivas,
Claire Grover,
Víctor Suárez-Paniagua,
Richard Tobin,
William Whiteley,
Honghan Wu,
Beatrice Alex

Affiliations

Arlene Casey: School of Literatures, Languages and Cultures (LLC), University of Edinburgh
Emma Davidson: Centre for Clinical Brain Sciences, University of Edinburgh
Michael Poon: Centre for Clinical Brain Sciences, University of Edinburgh
Hang Dong: Centre for Medical Informatics, Usher Institute of Population Health Sciences and Informatics, University of Edinburgh
Daniel Duma: School of Literatures, Languages and Cultures (LLC), University of Edinburgh
Andreas Grivas: Institute for Language, Cognition and Computation, School of informatics, University of Edinburgh
Claire Grover: Institute for Language, Cognition and Computation, School of informatics, University of Edinburgh
Víctor Suárez-Paniagua: Centre for Medical Informatics, Usher Institute of Population Health Sciences and Informatics, University of Edinburgh
Richard Tobin: Institute for Language, Cognition and Computation, School of informatics, University of Edinburgh
William Whiteley: Centre for Clinical Brain Sciences, University of Edinburgh
Honghan Wu: Health Data Research UK
Beatrice Alex: School of Literatures, Languages and Cultures (LLC), University of Edinburgh

DOI: https://doi.org/10.1186/s12911-021-01533-7
Journal volume & issue: Vol. 21, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Background Natural language processing (NLP) has a significant role in advancing healthcare and has been found to be key in extracting structured information from radiology reports. Understanding recent developments in NLP application to radiology is of significance but recent reviews on this are limited. This study systematically assesses and quantifies recent literature in NLP applied to radiology reports. Methods We conduct an automated literature search yielding 4836 results using automated filtering, metadata enriching steps and citation search combined with manual review. Our analysis is based on 21 variables including radiology characteristics, NLP methodology, performance, study, and clinical application characteristics. Results We present a comprehensive analysis of the 164 publications retrieved with publications in 2019 almost triple those in 2015. Each publication is categorised into one of 6 clinical application categories. Deep learning use increases in the period but conventional machine learning approaches are still prevalent. Deep learning remains challenged when data is scarce and there is little evidence of adoption into clinical practice. Despite 17% of studies reporting greater than 0.85 F1 scores, it is hard to comparatively evaluate these approaches given that most of them use different datasets. Only 14 studies made their data and 15 their code available with 10 externally validating results. Conclusions Automated understanding of clinical narratives of the radiology reports has the potential to enhance the healthcare process and we show that research in this field continues to grow. Reproducibility and explainability of models are important if the domain is to move applications into clinical use. More could be done to share code enabling validation of methods on different institutional data and to reduce heterogeneity in reporting of study properties allowing inter-study comparisons. Our results have significance for researchers in the field providing a systematic synthesis of existing work to build on, identify gaps, opportunities for collaboration and avoid duplication.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords