International Journal of General Medicine (Apr 2025)

Single Cell Transcriptomics Genomics Based on Machine Learning Algorithm: Constructing and Validating Neutrophil Extracellular Trap Gene Model in COPD

  • Yu J,
  • Xiao T,
  • Pan Y,
  • He Y,
  • Tan J

Journal volume & issue
Vol. Volume 18
pp. 2247 – 2261

Abstract

Read online

Jia Yu,1,* Tiantian Xiao,1,* Yun Pan,2,* Yangshen He,1 Jiaxiong Tan3 1Department of Internal Medicine, Dongguan Hospital of Integrated Chinese and Western Medicine, Dongguan, Guangdong Province, People’s Republic of China; 2Department of Infectious Disease, The First Affiliated Hospital of Jinan University, Guangzhou, Guangdong Province, People’s Republic of China; 3Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Tianjin, People’s Republic of China*These authors contributed equally to this workCorrespondence: Yangshen He, Dongguan Hospital of Integrated Chinese and Western Medicine, Dongguan, Guangdong Province, 523000, People’s Republic of China, Email [email protected] Jiaxiong Tan, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Tianjin, 300202, People’s Republic of China, Email [email protected]: Neutrophil trap (NET) is an important feature of chronic inflammatory diseases. At present, there are still few studies to explore the characteristics of NET in different chronic obstructive pulmonary disease (COPD) patients. This study aimed to identify NET signature genes in different COPD patients.Methods: We analyzed single-cell RNA sequencing data from COPD and non-COPD individuals to identify differentially expressed neutrophil genes. Machine learning algorithms were applied to construct models A and B, specific to smoking and non-smoking COPD patients, respectively.Results: Through single-cell cluster analysis, 165 neutrophil characteristic genes in COPD group were successfully identified. Model A, consisting of key genes CD63, RNASE2, ERAP2, and model B, consisting of GRIPAP1, NHS, EGFLAM, and GLUL, were validated internally and externally, showing significant risk scores and good diagnostic efficacy (AUC: 60.24– 87.22). Alveolar lavage fluid in patients with COPD was studied and confirmed higher expression levels of RNASE2 and NHS in severe COPD patients.Conclusion: The study successfully developed NET signature gene models for identifying smoking and non-smoking COPD respectively, with validated specificity and predictive power, offering a foundation for personalized treatment strategies.Keywords: COPD, neutrophil extracellular traps, single-cell sequencing, transcriptomics

Keywords