Protocol for evaluating the fitness for purpose of an artificial intelligence product for radiology reporting in the BreastScreen New South Wales breast cancer screening programme

David Roder; Richard Walton; Chirag Mistry; Tracey A O’Brien; Matthew Warner-Smith; Kan Ren; Nalini Bhola; Sarah McGill

doi:10.1136/bmjopen-2023-082350

BMJ Open (May 2024)

Protocol for evaluating the fitness for purpose of an artificial intelligence product for radiology reporting in the BreastScreen New South Wales breast cancer screening programme

David Roder,
Richard Walton,
Chirag Mistry,
Tracey A O’Brien,
Matthew Warner-Smith,
Kan Ren,
Nalini Bhola,
Sarah McGill

Affiliations

David Roder: 2 Cancer Research Institute, University of South Australia, Adelaide, South Australia, Australia
Richard Walton: 1 Cancer Institute NSW, St Leonards, New South Wales, Australia
Chirag Mistry: 1 Cancer Institute NSW, St Leonards, New South Wales, Australia
Tracey A O’Brien: 1 Cancer Institute NSW, St Leonards, New South Wales, Australia
Matthew Warner-Smith: 1 Cancer Institute NSW, St Leonards, New South Wales, Australia
Kan Ren: 1 Cancer Institute NSW, St Leonards, New South Wales, Australia
Nalini Bhola: 1 Cancer Institute NSW, St Leonards, New South Wales, Australia
Sarah McGill: 1 Cancer Institute NSW, St Leonards, New South Wales, Australia

DOI: https://doi.org/10.1136/bmjopen-2023-082350
Journal volume & issue: Vol. 14, no. 5

Abstract

Read online

Introduction Radiologist shortages threaten the sustainability of breast cancer screening programmes. Artificial intelligence (AI) products that can interpret mammograms could mitigate this risk. While previous studies have suggested this technology has accuracy comparable to radiologists most have been limited by using ‘enriched’ datasets and/or not considering the interaction between the algorithm and human readers. This study will address these limitations by comparing the accuracy of a workflow using AI alongside radiologists on a large consecutive cohort of examinations from a breast cancer screening programme. The study will combine the strengths of a large retrospective design with the benefit of prospective data collection. It will test this technology without risk to screening programme participants nor the need to wait for follow-up data. With a sample of 2 years of consecutive screening examinations, it is likely the largest test of this technology to date. The study will help determine whether this technology can safely be introduced into the BreastScreen New South Wales (NSW) population-based screening programme to address radiology workforce risks without compromising cancer detection rates or increasing false-positive recalls.Methods and analysis A retrospective, consecutive cohort of digital mammography screens from 658 207 examinations from BreastScreen NSW will be reinterpreted by the Lunit Insight MMG AI product. The cohort includes 4383 screen-detected and 1171 interval cancers. The results will be compared with radiologist single reading and the AI results will also be used to replace the second reader in a double-reading model. New adjudication reading will be performed where the AI disagrees with the first reader. Recall rates and cancer detection rates of combined AI–radiologist reading will be compared with the rates obtained at the time of screening.Ethics and dissemination This study has ethical approval from the NSW Health Population Health Services Research Ethics Committee (2022/ETH02397). Findings will be published in peer-reviewed journals and presented at conferences. The findings of this evaluation will be provided to programme managers, governance bodies and other stakeholders in Australian breast cancer screening programmes.

Published in BMJ Open

ISSN: 2044-6055 (Online)
Publisher: BMJ Publishing Group
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://bmjopen.bmj.com

About the journal