Generalisable deep learning method for mammographic density prediction across imaging techniques and self-reported race

Galvin Khara; Hari Trivedi; Mary S. Newell; Ravi Patel; Tobias Rijken; Peter Kecskemethy; Ben Glocker

doi:10.1038/s43856-024-00446-6

Communications Medicine (Feb 2024)

Generalisable deep learning method for mammographic density prediction across imaging techniques and self-reported race

Galvin Khara,
Hari Trivedi,
Mary S. Newell,
Ravi Patel,
Tobias Rijken,
Peter Kecskemethy,
Ben Glocker

Affiliations

Galvin Khara: Kheiron Medical Technologies
Hari Trivedi: Winship Cancer Institute, Emory University
Mary S. Newell: Winship Cancer Institute, Emory University
Ravi Patel: Kheiron Medical Technologies
Tobias Rijken: Kheiron Medical Technologies
Peter Kecskemethy: Kheiron Medical Technologies
Ben Glocker: Kheiron Medical Technologies

DOI: https://doi.org/10.1038/s43856-024-00446-6
Journal volume & issue: Vol. 4, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Background Breast density is an important risk factor for breast cancer complemented by a higher risk of cancers being missed during screening of dense breasts due to reduced sensitivity of mammography. Automated, deep learning-based prediction of breast density could provide subject-specific risk assessment and flag difficult cases during screening. However, there is a lack of evidence for generalisability across imaging techniques and, importantly, across race. Methods This study used a large, racially diverse dataset with 69,697 mammographic studies comprising 451,642 individual images from 23,057 female participants. A deep learning model was developed for four-class BI-RADS density prediction. A comprehensive performance evaluation assessed the generalisability across two imaging techniques, full-field digital mammography (FFDM) and two-dimensional synthetic (2DS) mammography. A detailed subgroup performance and bias analysis assessed the generalisability across participants’ race. Results Here we show that a model trained on FFDM-only achieves a 4-class BI-RADS classification accuracy of 80.5% (79.7–81.4) on FFDM and 79.4% (78.5–80.2) on unseen 2DS data. When trained on both FFDM and 2DS images, the performance increases to 82.3% (81.4–83.0) and 82.3% (81.3–83.1). Racial subgroup analysis shows unbiased performance across Black, White, and Asian participants, despite a separate analysis confirming that race can be predicted from the images with a high accuracy of 86.7% (86.0–87.4). Conclusions Deep learning-based breast density prediction generalises across imaging techniques and race. No substantial disparities are found for any subgroup, including races that were never seen during model development, suggesting that density predictions are unbiased.

Published in Communications Medicine

ISSN: 2730-664X (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://www.nature.com/commsmed/

About the journal