BMC Cardiovascular Disorders (Jul 2023)

Identification of crucial genes related to heart failure based on GEO database

  • Yongliang Chen,
  • Jing Xue,
  • Xiaoli Yan,
  • Da-guang Fang,
  • Fangliang Li,
  • Xuefei Tian,
  • Peng Yan,
  • Zengbin Feng

DOI
https://doi.org/10.1186/s12872-023-03400-x
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background The molecular biological mechanisms underlying heart failure (HF) remain poorly understood. Therefore, it is imperative to use innovative approaches, such as high-throughput sequencing and artificial intelligence, to investigate the pathogenesis, diagnosis, and potential treatment of HF. Methods First, we initially screened Two data sets (GSE3586 and GSE5406) from the GEO database containing HF and control samples from the GEO database to establish the Train group, and selected another dataset (GSE57345) to construct the Test group for verification. Next, we identified the genes with significantly different expression levels in patients with or without HF and performed functional and pathway enrichment analyses. HF-specific genes were identified, and an artificial neural network was constructed by Random Forest. The ROC curve was used to evaluate the accuracy and reliability of the constructed model in the Train and Test groups. Finally, immune cell infiltration was analyzed to determine the role of the inflammatory response and the immunological microenvironment in the pathogenesis of HF. Results In the Train group, 153 significant differentially expressed genes (DEGs) associated with HF were found to be abnormal, including 81 down-regulated genes and 72 up-regulated genes. GO and KEGG enrichment analyses revealed that the down-regulated genes were primarily enriched in organic anion transport, neutrophil activation, and the PI3K-Akt signaling pathway. The upregulated genes were mainly enriched in neutrophil activation and the calcium signaling. DEGs were identified using Random Forest, and finally, 16 HF-specific genes were obtained. In the ROC validation and evaluation, the area under the curve (AUC) of the Train and Test groups were 0.996 and 0.863, respectively. Conclusions Our research revealed the potential functions and pathways implicated in the progression of HF, and designed an RNA diagnostic model for HF tissues using machine learning and artificial neural networks. Sensitivity, specificity, and stability were confirmed by ROC curves in the two different cohorts.

Keywords