Mammographic Breast Density Model Using Semi-Supervised Learning Reduces Inter-/Intra-Reader Variability

Alyssa T. Watanabe; Tara Retson; Junhao Wang; Richard Mantey; Chiyung Chim; Homa Karimabadi

doi:10.3390/diagnostics13162694

Diagnostics (Aug 2023)

Mammographic Breast Density Model Using Semi-Supervised Learning Reduces Inter-/Intra-Reader Variability

Alyssa T. Watanabe,
Tara Retson,
Junhao Wang,
Richard Mantey,
Chiyung Chim,
Homa Karimabadi

Affiliations

Alyssa T. Watanabe: Department of Radiology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90007, USA
Tara Retson: Department of Radiology, University of California, San Diego, CA 92093, USA
Junhao Wang: CureMetrix, Inc., San Diego, CA 92101, USA
Richard Mantey: CureMetrix, Inc., San Diego, CA 92101, USA
Chiyung Chim: CureMetrix, Inc., San Diego, CA 92101, USA
Homa Karimabadi: CureMetrix, Inc., San Diego, CA 92101, USA

DOI: https://doi.org/10.3390/diagnostics13162694
Journal volume & issue: Vol. 13, no. 16
p. 2694

Abstract

Read online

Breast density is an important risk factor for breast cancer development; however, imager inconsistency in density reporting can lead to patient and clinician confusion. A deep learning (DL) model for mammographic density grading was examined in a retrospective multi-reader multi-case study consisting of 928 image pairs and assessed for impact on inter- and intra-reader variability and reading time. Seven readers assigned density categories to the images, then re-read the test set aided by the model after a 4-week washout. To measure intra-reader agreement, 100 image pairs were blindly double read in both sessions. Linear Cohen Kappa (κ) and Student’s t-test were used to assess the model and reader performance. The model achieved a κ of 0.87 (95% CI: 0.84, 0.89) for four-class density assessment and a κ of 0.91 (95% CI: 0.88, 0.93) for binary non-dense/dense assessment. Superiority tests showed significant reduction in inter-reader variability (κ improved from 0.70 to 0.88, p ≤ 0.001) and intra-reader variability (κ improved from 0.83 to 0.95, p ≤ 0.01) for four-class density, and significant reduction in inter-reader variability (κ improved from 0.77 to 0.96, p ≤ 0.001) and intra-reader variability (κ improved from 0.89 to 0.97, p ≤ 0.01) for binary non-dense/dense assessment when aided by DL. The average reader mean reading time per image pair also decreased by 30%, 0.86 s (95% CI: 0.01, 1.71), with six of seven readers having reading time reductions.

Published in Diagnostics

ISSN: 2075-4418 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General)
Website: http://www.mdpi.com/journal/diagnostics

About the journal

Abstract

Keywords