Frontiers in Genetics (Nov 2022)

Identification of subtypes of hepatocellular carcinoma and screening of prognostic molecular diagnostic markers based on cell adhesion molecule related genes

  • Ruge Sun,
  • Ruge Sun,
  • Yanchao Gao,
  • Fengjun Shen

DOI
https://doi.org/10.3389/fgene.2022.1042540
Journal volume & issue
Vol. 13

Abstract

Read online

Cell adhesion molecules can predict liver hepatocellular carcinoma (LIHC) metastasis and determine prognosis, while the mechanism of the role of cell adhesion molecules in LIHC needs to be further explored. LIHC-related expression data were sourced from The Cancer Genome Atlas (TCGA) and the gene expression omnibus (GEO) databases, and genes related to cell adhesion were sourced from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. First, the TCGA-LIHC dataset was clustered by the nonnegative matrix factorization (NMF) algorithm to find different subtypes of LIHC. Then the difference of prognosis and immune microenvironment between patients of different subtypes was evaluated. In addition, a prognostic risk model was obtained by least shrinkage and selection operator (LASSO) and Cox analysis, while a nomogram was drawn. Furthermore, functional enrichment analysis between high and low risk groups was conducted. Finally, the expressions of model genes were explored by quantitative real-time polymerase chain reaction (qRT-PCR). The 371 LIHC patients were classified into four subtypes by NMF clustering, and survival analysis revealed that disease-free survival (DFS) of these four subtypes were clearly different. Cancer-related pathways and immune microenvironment among these four subtypes were dysregulated. Moreover, 58 common differentially expressed genes (DEGs) between four subtypes were identified and were mainly associated with PPAR signaling pathway and amino acid metabolism. Furthermore, a prognostic model consisting of IGSF11, CD8A, ALCAM, CLDN6, JAM2, ITGB7, SDC3, CNTNAP1, and MPZ was built. A nomogram consisting of pathologic T and riskScore was built, and the calibration curve illustrated that the nomogram could better forecast LIHC prognosis. Gene Set Enrichment Analysis (GSEA) demonstrated that DEGs between high and low risk groups were mainly involved in cell cycle. Finally, the qRT-PCR illustrated the expressions of nine model genes between normal and LIHC tissue. A prognostic model consisting of IGSF11, CD8A, ALCAM, CLDN6, JAM2, ITGB7, SDC3, CNTNAP1, and MPZ was obtained, which provides an important reference for the molecular diagnosis of patient prognosis.

Keywords