Thoracic Cancer (Jun 2021)

Using a risk model for probability of cancer in pulmonary nodules

  • Si‐Qi Liu,
  • Xiao‐Bin Ma,
  • Wan‐Mei Song,
  • Yi‐Fan Li,
  • Ning Li,
  • Li‐Na Wang,
  • Jin‐Yue Liu,
  • Ning‐Ning Tao,
  • Shi‐Jin Li,
  • Ting‐Ting Xu,
  • Qian‐Yun Zhang,
  • Qi‐Qi An,
  • Bin Liang,
  • Huai‐Chen Li

DOI
https://doi.org/10.1111/1759-7714.13991
Journal volume & issue
Vol. 12, no. 12
pp. 1881 – 1889

Abstract

Read online

Abstract Background Considering the high morbidity and mortality of lung cancer and the high incidence of pulmonary nodules, clearly distinguishing benign from malignant lung nodules at an early stage is of great significance. However, determining the kind of lung nodule which is more prone to lung cancer remains a problem worldwide. Methods A total of 480 patients with pulmonary nodule data were collected from Shandong, China. We assessed the clinical characteristics and computed tomography (CT) imaging features among pulmonary nodules in patients who had undergone video‐assisted thoracoscopic surgery (VATS) lobectomy from 2013 to 2018. Preliminary selection of features was based on a statistical analysis using SPSS. We used WEKA to assess the machine learning models using its multiple algorithms and selected the best decision tree model using its optimization algorithm. Results The combination of decision tree and logistics regression optimized the decision tree without affecting its AUC. The decision tree structure showed that lobulation was the most important feature, followed by spiculation, vessel convergence sign, nodule type, satellite nodule, nodule size and age of patient. Conclusions Our study shows that decision tree analyses can be applied to screen individuals for early lung cancer with CT. Our decision tree provides a new way to help clinicians establish a logical diagnosis by a stepwise progression method, but still needs to be validated for prospective trials in a larger patient population.

Keywords