Enhancing Annotation Efficiency with Machine Learning: Automated Partitioning of a Lung Ultrasound Dataset by View

Bennett VanBerlo; Delaney Smith; Jared Tschirhart; Blake VanBerlo; Derek Wu; Alex Ford; Joseph McCauley; Benjamin Wu; Rushil Chaudhary; Chintan Dave; Jordan Ho; Jason Deglint; Brian Li; Robert Arntfield

doi:10.3390/diagnostics12102351

Diagnostics (Sep 2022)

Enhancing Annotation Efficiency with Machine Learning: Automated Partitioning of a Lung Ultrasound Dataset by View

Bennett VanBerlo,
Delaney Smith,
Jared Tschirhart,
Blake VanBerlo,
Derek Wu,
Alex Ford,
Joseph McCauley,
Benjamin Wu,
Rushil Chaudhary,
Chintan Dave,
Jordan Ho,
Jason Deglint,
Brian Li,
Robert Arntfield

Affiliations

Bennett VanBerlo: Faculty of Engineering, University of Western Ontario, London, ON N6A 5C1, Canada
Delaney Smith: Faculty of Mathematics, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Jared Tschirhart: Schulich School of Medicine and Dentistry, Western University, London, ON N6A 5C1, Canada
Blake VanBerlo: Faculty of Mathematics, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Derek Wu: Department of Medicine, Western University, London, ON N6A 5C1, Canada
Alex Ford: Lawson Health Research Institute, London, ON N6C 2R5, Canada
Joseph McCauley: Faculty of Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Benjamin Wu: Lawson Health Research Institute, London, ON N6C 2R5, Canada
Rushil Chaudhary: Department of Medicine, Western University, London, ON N6A 5C1, Canada
Chintan Dave: Division of Critical Care Medicine, Western University, London, ON N6A 5C1, Canada
Jordan Ho: Department of Family Medicine, Western University, London, ON N6A 5C1, Canada
Jason Deglint: Faculty of Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Brian Li: Faculty of Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
Robert Arntfield: Division of Critical Care Medicine, Western University, London, ON N6A 5C1, Canada

DOI: https://doi.org/10.3390/diagnostics12102351
Journal volume & issue: Vol. 12, no. 10
p. 2351

Abstract

Read online

Background: Annotating large medical imaging datasets is an arduous and expensive task, especially when the datasets in question are not organized according to deep learning goals. Here, we propose a method that exploits the hierarchical organization of annotating tasks to optimize efficiency. Methods: We trained a machine learning model to accurately distinguish between one of two classes of lung ultrasound (LUS) views using 2908 clips from a larger dataset. Partitioning the remaining dataset by view would reduce downstream labelling efforts by enabling annotators to focus on annotating pathological features specific to each view. Results: In a sample view-specific annotation task, we found that automatically partitioning a 780-clip dataset by view saved 42 min of manual annotation time and resulted in 55±6 additional relevant labels per hour. Conclusions: Automatic partitioning of a LUS dataset by view significantly increases annotator efficiency, resulting in higher throughput relevant to the annotating task at hand. The strategy described in this work can be applied to other hierarchical annotation schemes.

Published in Diagnostics

ISSN: 2075-4418 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General)
Website: http://www.mdpi.com/journal/diagnostics

About the journal

Abstract

Keywords