Diagnostics (May 2025)

Leveraging Subjective Parameters and Biomarkers in Machine Learning Models: The Feasibility of <i>lnc-IL7R</i> for Managing Emphysema Progression

  • Tzu-Tao Chen,
  • Tzu-Yu Cheng,
  • I-Jung Liu,
  • Shu-Chuan Ho,
  • Kang-Yun Lee,
  • Huei-Tyng Huang,
  • Po-Hao Feng,
  • Kuan-Yuan Chen,
  • Ching-Shan Luo,
  • Chien-Hua Tseng,
  • Yueh-His Chen,
  • Arnab Majumdar,
  • Cheng-Yu Tsai,
  • Sheng-Ming Wu

DOI
https://doi.org/10.3390/diagnostics15091165
Journal volume & issue
Vol. 15, no. 9
p. 1165

Abstract

Read online

Background/Objectives: Chronic obstructive pulmonary disease (COPD) remains a leading cause of death worldwide, with emphysema progression providing valuable insights into disease development. Clinical assessment approaches, including pulmonary function tests and high-resolution computed tomography, are limited by accessibility constraints and radiation exposure. This study, therefore, proposed an alternative approach by integrating the novel biomarker long non-coding interleukin-7 receptor α-subunit gene (lnc-Il7R), along with other easily accessible clinical and biochemical metrics, into machine learning (ML) models. Methods: This cohort study collected baseline characteristics, COPD Assessment Test (CAT) scores, and biochemical details from the enrolled participants. Associations with emphysema severity, defined by a low attenuation area percentage (LAA%) threshold of 15%, were evaluated using simple and multivariate-adjusted models. The dataset was then split into training and validation (80%) and test (20%) subsets. Five ML models were employed, with the best-performing model being further analyzed for feature importance. Results: The majority of participants were elderly males. Compared to the LAA% lnc-Il7R (all p lnc-IL7R were strongly and negatively associated with LAA% (p lnc-IL7R fold changes as the strongest predictor for emphysema classification (LAA% ≥15%), followed by CAT scores and BMI. Conclusions: Machine learning models incorporated accessible clinical and biochemical markers, particularly the novel biomarker lnc-IL7R, achieving classification accuracy and AUROC exceeding 75% in emphysema assessments. These findings offer promising opportunities for improving emphysema classification and COPD management.

Keywords