Journal of Big Data (Aug 2024)

Interpolation-split: a data-centric deep learning approach with big interpolated data to boost airway segmentation performance

  • Wing Keung Cheung,
  • Ashkan Pakzad,
  • Nesrin Mogulkoc,
  • Sarah Helen Needleman,
  • Bojidar Rangelov,
  • Eyjolfur Gudmundsson,
  • An Zhao,
  • Mariam Abbas,
  • Davina McLaverty,
  • Dimitrios Asimakopoulos,
  • Robert Chapman,
  • Recep Savas,
  • Sam M. Janes,
  • Yipeng Hu,
  • Daniel C. Alexander,
  • John R. Hurst,
  • Joseph Jacob

DOI
https://doi.org/10.1186/s40537-024-00974-x
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 30

Abstract

Read online

Abstract The morphology and distribution of airway tree abnormalities enable diagnosis and disease characterisation across a variety of chronic respiratory conditions. In this regard, airway segmentation plays a critical role in the production of the outline of the entire airway tree to enable estimation of disease extent and severity. Furthermore, the segmentation of a complete airway tree is challenging as the intensity, scale/size and shape of airway segments and their walls change across generations. The existing classical techniques either provide an undersegmented or oversegmented airway tree, and manual intervention is required for optimal airway tree segmentation. The recent development of deep learning methods provides a fully automatic way of segmenting airway trees; however, these methods usually require high GPU memory usage and are difficult to implement in low computational resource environments. Therefore, in this study, we propose a data-centric deep learning technique with big interpolated data, Interpolation-Split, to boost the segmentation performance of the airway tree. The proposed technique utilises interpolation and image split to improve data usefulness and quality. Then, an ensemble learning strategy is implemented to aggregate the segmented airway segments at different scales. In terms of average segmentation performance (dice similarity coefficient, DSC), our method (A) achieves 90.55%, 89.52%, and 85.80%; (B) outperforms the baseline models by 2.89%, 3.86%, and 3.87% on average; and (C) produces maximum segmentation performance gain by 14.11%, 9.28%, and 12.70% for individual cases when (1) nnU-Net with instant normalisation and leaky ReLU; (2) nnU-Net with batch normalisation and ReLU; and (3) modified dilated U-Net are used respectively. Our proposed method outperformed the state-of-the-art airway segmentation approaches. Furthermore, our proposed technique has low RAM and GPU memory usage, and it is GPU memory-efficient and highly flexible, enabling it to be deployed on any 2D deep learning model.