Biochemistry and Biophysics Reports (Jul 2024)

SH3GL2 and MMP17 as lung adenocarcinoma biomarkers: a machine-learning based approach

  • Zengjian Tian,
  • Shilong Yu,
  • Ruizhi Cai,
  • Yinghui Zhang,
  • Qilun Liu,
  • Yongzhao Zhu

Journal volume & issue
Vol. 38
p. 101693

Abstract

Read online

Objective: Using bioinformatics machine learning methods, our research aims to identify the potential key genes associated with Lung adenocarcinoma (LUAD). Methods: We obtained two gene expression profiling microarrays (GSE68571 and GSE74706) from the public Gene Expression Omnibus (GEO) database at the National Centre for Biotechnology Information (NCBI). The purpose was to identify Differentially Expressed Genes (DEGs) between the lung adenocarcinoma group and the healthy control group. The limma R package in R was utilized for this analysis. For the differential gene diagnosis of lung adenocarcinoma, we employed the least absolute shrinkage and selection operator (LASSO) regression and SVM-RFE screening crossover. To evaluate the performance, ROC curves were plotted. We performed immuno-infiltration analysis using CIBERSORT. Finally, we validated the key genes through qRT-PCR and Western-blot verification, then downregulated MMP17 gene expression, upregulated SH3GL2 gene expression, and performed CCK8 experiments. Results: A total of 32 Differentially Expressed Genes (DEGs) were identified. Two diagnostic marker genes, SH3GL2 and MMP17, were selected by employing LASSO and SVM-RFE machine learning methods. In Lung adenocarcinoma cells, the expression of MMP17 was observed to be elevated compared to normal lung epithelial cells in the control group (P < 0.05). In contrast, a down-regulation of SH3GL2 was found in Lung adenocarcinoma cells (P < 0.05). Finally, we downregulated MMP17 and upregulated SH3GL2 gene expression, then the CCK8 showed that the proliferation of both lung cancer cells was inhibited. Conclusion: SH3GL2 and MMP17 are expected to be potential biomarkers for Lung adenocarcinoma.

Keywords