Biomarker Research (Nov 2023)

Improving the prediction of Spreading Through Air Spaces (STAS) in primary lung cancer with a dynamic dual-delta hybrid machine learning model: a multicenter cohort study

  • Weiqiu Jin,
  • Leilei Shen,
  • Yu Tian,
  • Hongda Zhu,
  • Ningyuan Zou,
  • Mengwei Zhang,
  • Qian Chen,
  • Changzi Dong,
  • Qisheng Yang,
  • Long Jiang,
  • Jia Huang,
  • Zheng Yuan,
  • Xiaodan Ye,
  • Qingquan Luo

DOI
https://doi.org/10.1186/s40364-023-00539-9
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Background Reliable pre-surgical prediction of spreading through air spaces (STAS) in primary lung cancer is essential for precision treatment and surgical decision-making. We aimed to develop and validate a dual-delta deep-learning and radiomics model based on pretreatment computed tomography (CT) image series to predict the STAS in patients with lung cancer. Method Six hundred seventy-four patients with pre-surgery CT follow-up scans (with a minimum interval of two weeks) and primary lung cancer diagnosed by surgery were retrospectively recruited from three Chinese hospitals. The training cohort and internal validation cohort, comprising 509 and 76 patients respectively, were selected from Shanghai Chest Hospital; the external validation cohorts comprised 36 and 53 patients from two other centers, respectively. Four imaging signatures (classic radiomics features and deep learning [DL] features, delta-radiomics and delta-DL features) reflecting the STAS status were constructed from the pretreatment CT images by comprehensive methods including handcrafting, 3D views extraction, image registration and subtraction. A stepwise optimized three-step procedure, including feature extraction (by DL and time-base radiomics slope), feature selection (by reproducibility check and 45 selection algorithms), and classification (32 classifiers considered), was applied for signature building and methodology optimization. The interpretability of the proposed model was further assessed with Grad-CAM for DL-features and feature ranking for radiomics features. Results The dual-delta model showed satisfactory discrimination between STAS and non-STAS and yielded the areas under the receiver operating curve (AUCs) of 0.94 (95% CI, 0.92–0.96), 0.84 (95% CI, 0.82–0.86), and 0.84 (95% CI, 0.83–0.85) in the internal and two external validation cohorts, respectively, with interpretable core feature sets and feature maps. Conclusion The coupling of delta-DL model with delta-radiomics features enriches information such as anisotropy of tumor growth and heterogeneous changes within the tumor during the radiological follow-up, which could provide valuable information for STAS prediction in primary lung cancer.

Keywords