Cancers (Mar 2023)

Independent Validation of a Deep Learning nnU-Net Tool for Neuroblastoma Detection and Segmentation in MR Images

  • Diana Veiga-Canuto,
  • Leonor Cerdà-Alberich,
  • Ana Jiménez-Pastor,
  • José Miguel Carot Sierra,
  • Armando Gomis-Maya,
  • Cinta Sangüesa-Nebot,
  • Matías Fernández-Patón,
  • Blanca Martínez de las Heras,
  • Sabine Taschner-Mandl,
  • Vanessa Düster,
  • Ulrike Pötschger,
  • Thorsten Simon,
  • Emanuele Neri,
  • Ángel Alberich-Bayarri,
  • Adela Cañete,
  • Barbara Hero,
  • Ruth Ladenstein,
  • Luis Martí-Bonmatí

DOI
https://doi.org/10.3390/cancers15051622
Journal volume & issue
Vol. 15, no. 5
p. 1622

Abstract

Read online

Objectives. To externally validate and assess the accuracy of a previously trained fully automatic nnU-Net CNN algorithm to identify and segment primary neuroblastoma tumors in MR images in a large children cohort. Methods. An international multicenter, multivendor imaging repository of patients with neuroblastic tumors was used to validate the performance of a trained Machine Learning (ML) tool to identify and delineate primary neuroblastoma tumors. The dataset was heterogeneous and completely independent from the one used to train and tune the model, consisting of 300 children with neuroblastic tumors having 535 MR T2-weighted sequences (486 sequences at diagnosis and 49 after finalization of the first phase of chemotherapy). The automatic segmentation algorithm was based on a nnU-Net architecture developed within the PRIMAGE project. For comparison, the segmentation masks were manually edited by an expert radiologist, and the time for the manual editing was recorded. Different overlaps and spatial metrics were calculated to compare both masks. Results. The median Dice Similarity Coefficient (DSC) was high 0.997; 0.944–1.000 (median; Q1–Q3). In 18 MR sequences (6%), the net was not able neither to identify nor segment the tumor. No differences were found regarding the MR magnetic field, type of T2 sequence, or tumor location. No significant differences in the performance of the net were found in patients with an MR performed after chemotherapy. The time for visual inspection of the generated masks was 7.9 ± 7.5 (mean ± Standard Deviation (SD)) seconds. Those cases where manual editing was needed (136 masks) required 124 ± 120 s. Conclusions. The automatic CNN was able to locate and segment the primary tumor on the T2-weighted images in 94% of cases. There was an extremely high agreement between the automatic tool and the manually edited masks. This is the first study to validate an automatic segmentation model for neuroblastic tumor identification and segmentation with body MR images. The semi-automatic approach with minor manual editing of the deep learning segmentation increases the radiologist’s confidence in the solution with a minor workload for the radiologist.

Keywords