Frontiers in Neuroscience (Dec 2021)

Robust, Primitive, and Unsupervised Quality Estimation for Segmentation Ensembles

  • Florian Kofler,
  • Florian Kofler,
  • Florian Kofler,
  • Ivan Ezhov,
  • Ivan Ezhov,
  • Lucas Fidon,
  • Carolin M. Pirkl,
  • Johannes C. Paetzold,
  • Johannes C. Paetzold,
  • Egon Burian,
  • Sarthak Pati,
  • Sarthak Pati,
  • Sarthak Pati,
  • Sarthak Pati,
  • Malek El Husseini,
  • Malek El Husseini,
  • Fernando Navarro,
  • Fernando Navarro,
  • Fernando Navarro,
  • Suprosanna Shit,
  • Suprosanna Shit,
  • Jan Kirschke,
  • Spyridon Bakas,
  • Spyridon Bakas,
  • Spyridon Bakas,
  • Claus Zimmer,
  • Benedikt Wiestler,
  • Bjoern H. Menze,
  • Bjoern H. Menze

DOI
https://doi.org/10.3389/fnins.2021.752780
Journal volume & issue
Vol. 15

Abstract

Read online

A multitude of image-based machine learning segmentation and classification algorithms has recently been proposed, offering diagnostic decision support for the identification and characterization of glioma, Covid-19 and many other diseases. Even though these algorithms often outperform human experts in segmentation tasks, their limited reliability, and in particular the inability to detect failure cases, has hindered translation into clinical practice. To address this major shortcoming, we propose an unsupervised quality estimation method for segmentation ensembles. Our primitive solution examines discord in binary segmentation maps to automatically flag segmentation results that are particularly error-prone and therefore require special assessment by human readers. We validate our method both on segmentation of brain glioma in multi-modal magnetic resonance - and of lung lesions in computer tomography images. Additionally, our method provides an adaptive prioritization mechanism to maximize efficacy in use of human expert time by enabling radiologists to focus on the most difficult, yet important cases while maintaining full diagnostic autonomy. Our method offers an intuitive and reliable uncertainty estimation from segmentation ensembles and thereby closes an important gap toward successful translation of automatic segmentation into clinical routine.

Keywords