Factors affecting the labelling accuracy of brain MRI studies relevant for deep learning abnormality detection

Matthew Benger; David A. Wood; Sina Kafiabadi; Aisha Al Busaidi; Emily Guilhem; Jeremy Lynch; Matthew Townend; Antanas Montvila; Juveria Siddiqui; Naveen Gadapa; Gareth Barker; Sebastian Ourselin; James H. Cole; James H. Cole; Thomas C. Booth; Thomas C. Booth

doi:10.3389/fradi.2023.1251825

Frontiers in Radiology (Nov 2023)

Factors affecting the labelling accuracy of brain MRI studies relevant for deep learning abnormality detection

Matthew Benger,
David A. Wood,
Sina Kafiabadi,
Aisha Al Busaidi,
Emily Guilhem,
Jeremy Lynch,
Matthew Townend,
Antanas Montvila,
Juveria Siddiqui,
Naveen Gadapa,
Gareth Barker,
Sebastian Ourselin,
James H. Cole,
James H. Cole,
Thomas C. Booth,
Thomas C. Booth

Affiliations

Matthew Benger: Department of Neuroradiology, Kings College Hospital, London, United Kingdom
David A. Wood: School of Biomedical Engineering & Imaging Sciences, Kings College London, London, United Kingdom
Sina Kafiabadi: Department of Neuroradiology, Kings College Hospital, London, United Kingdom
Aisha Al Busaidi: Department of Neuroradiology, Kings College Hospital, London, United Kingdom
Emily Guilhem: Department of Neuroradiology, Kings College Hospital, London, United Kingdom
Jeremy Lynch: Department of Neuroradiology, Kings College Hospital, London, United Kingdom
Matthew Townend: School of Biomedical Engineering & Imaging Sciences, Kings College London, London, United Kingdom
Antanas Montvila: School of Biomedical Engineering & Imaging Sciences, Kings College London, London, United Kingdom
Juveria Siddiqui: Department of Neuroradiology, Kings College Hospital, London, United Kingdom
Naveen Gadapa: Department of Neuroradiology, Kings College Hospital, London, United Kingdom
Gareth Barker: Institute of Psychiatry, Psychology & Neuroscience, Kings College London, London, United Kingdom
Sebastian Ourselin: School of Biomedical Engineering & Imaging Sciences, Kings College London, London, United Kingdom
James H. Cole: Institute of Psychiatry, Psychology & Neuroscience, Kings College London, London, United Kingdom
James H. Cole: Centre for Medical Image Computing, Dementia Research, University College London, London, United Kingdom
Thomas C. Booth: Department of Neuroradiology, Kings College Hospital, London, United Kingdom
Thomas C. Booth: School of Biomedical Engineering & Imaging Sciences, Kings College London, London, United Kingdom

DOI: https://doi.org/10.3389/fradi.2023.1251825
Journal volume & issue: Vol. 3

Abstract

Read online

Unlocking the vast potential of deep learning-based computer vision classification systems necessitates large data sets for model training. Natural Language Processing (NLP)—involving automation of dataset labelling—represents a potential avenue to achieve this. However, many aspects of NLP for dataset labelling remain unvalidated. Expert radiologists manually labelled over 5,000 MRI head reports in order to develop a deep learning-based neuroradiology NLP report classifier. Our results demonstrate that binary labels (normal vs. abnormal) showed high rates of accuracy, even when only two MRI sequences (T2-weighted and those based on diffusion weighted imaging) were employed as opposed to all sequences in an examination. Meanwhile, the accuracy of more specific labelling for multiple disease categories was variable and dependent on the category. Finally, resultant model performance was shown to be dependent on the expertise of the original labeller, with worse performance seen with non-expert vs. expert labellers.

Published in Frontiers in Radiology

ISSN: 2673-8740 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General): Medical physics. Medical radiology. Nuclear medicine
Website: https://www.frontiersin.org/journals/radiology

About the journal

Abstract

Keywords