Scientific Reports (Dec 2021)

Independent evaluation of 12 artificial intelligence solutions for the detection of tuberculosis

  • Andrew J. Codlin,
  • Thang Phuoc Dao,
  • Luan Nguyen Quang Vo,
  • Rachel J. Forse,
  • Vinh Van Truong,
  • Ha Minh Dang,
  • Lan Huu Nguyen,
  • Hoa Binh Nguyen,
  • Nhung Viet Nguyen,
  • Kristi Sidney-Annerstedt,
  • Bertie Squire,
  • Knut Lönnroth,
  • Maxine Caws

DOI
https://doi.org/10.1038/s41598-021-03265-0
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 11

Abstract

Read online

Abstract There have been few independent evaluations of computer-aided detection (CAD) software for tuberculosis (TB) screening, despite the rapidly expanding array of available CAD solutions. We developed a test library of chest X-ray (CXR) images which was blindly re-read by two TB clinicians with different levels of experience and then processed by 12 CAD software solutions. Using Xpert MTB/RIF results as the reference standard, we compared the performance characteristics of each CAD software against both an Expert and Intermediate Reader, using cut-off thresholds which were selected to match the sensitivity of each human reader. Six CAD systems performed on par with the Expert Reader (Qure.ai, DeepTek, Delft Imaging, JF Healthcare, OXIPIT, and Lunit) and one additional software (Infervision) performed on par with the Intermediate Reader only. Qure.ai, Delft Imaging and Lunit were the only software to perform significantly better than the Intermediate Reader. The majority of these CAD software showed significantly lower performance among participants with a past history of TB. The radiography equipment used to capture the CXR image was also shown to affect performance for some CAD software. TB program implementers now have a wide selection of quality CAD software solutions to utilize in their CXR screening initiatives.