BMC Genomics (Dec 2023)

Identification of hub genes based on integrated analysis of single-cell and microarray transcriptome in patients with pulmonary arterial hypertension

  • Yuhan Qin,
  • Gaoliang Yan,
  • Yong Qiao,
  • Dong Wang,
  • Chengchun Tang

DOI
https://doi.org/10.1186/s12864-023-09892-3
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Background Pulmonary arterial hypertension (PAH) is a devastating chronic cardiopulmonary disease without an effective therapeutic approach. The underlying molecular mechanism of PAH remains largely unexplored at single-cell resolution. Methods Single-cell RNA sequencing (scRNA-seq) data from the Gene Expression Omnibus (GEO) database (GSE210248) was included and analyzed comprehensively. Additionally, microarray transcriptome data including 15 lung tissue from PAH patients and 11 normal samples (GSE113439) was also obtained. Seurat R package was applied to process scRNA-seq data. Uniform manifold approximation and projection (UMAP) was utilized for dimensionality reduction and cluster identification, and the SingleR package was performed for cell annotation. FindAllMarkers analysis and ClusterProfiler package were applied to identify differentially expressed genes (DEGs) for each cluster in GSE210248 and GSE113439, respectively. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genome (KEGG) were used for functional enrichment analysis of DEGs. Microenvironment Cell Populations counter (MCP counter) was applied to evaluate the immune cell infiltration. STRING was used to construct a protein-protein interaction (PPI) network of DEGs, followed by hub genes selection through Cytoscape software and Veen Diagram. Results Nineteen thousand five hundred seventy-six cells from 3 donors and 21,896 cells from 3 PAH patients remained for subsequent analysis after filtration. A total of 42 cell clusters were identified through UMAP and annotated by the SingleR package. 10 cell clusters with the top 10 cell amounts were selected for consequent analysis. Compared with the control group, the proportion of adipocytes and fibroblasts was significantly reduced, while CD8+ T cells and macrophages were notably increased in the PAH group. MCP counter revealed decreased distribution of CD8+ T cells, cytotoxic lymphocytes, and NK cells, as well as increased infiltration of monocytic lineage in PAH lung samples. Among 997 DEGs in GSE113439, module 1 with 68 critical genes was screened out through the MCODE plug-in in Cytoscape software. The top 20 DEGs in each cluster of GSE210248 were filtered out by the Cytohubba plug-in using the MCC method. Eventually, WDR43 and GNL2 were found significantly increased in PAH and identified as the hub genes after overlapping these DEGs from GSE210248 and GSE113439. Conclusion WDR43 and GNL2 might provide novel insight into revealing the new molecular mechanisms and potential therapeutic targets for PAH.

Keywords