Cancer Cell International (Jul 2020)

DNA methylation data-based molecular subtype classification and prediction in patients with gastric cancer

  • Qixin Lian,
  • Bo Wang,
  • Lijun Fan,
  • Junqiang Sun,
  • Guilai Wang,
  • Jidong Zhang

DOI
https://doi.org/10.1186/s12935-020-01253-4
Journal volume & issue
Vol. 20, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Background Genetic and epigenetic alterations have been indicated to be closely correlated with the carcinogenesis, DNA methylation is one of most frequently occurring molecular behavior that take place early during this complicated process in gastric cancer (GC). Methods In this study, 398 samples were collected from the cancer genome atlas (TCGA) database and were analyzed, so as to mine the specific DNA methylation sites that affected the prognosis for GC patients. Moreover, the 23,588 selected CpGs that were markedly correlated with patient prognosis were used for consistent clustering of the samples into 6 subgroups, and samples in each subgroup varied in terms of M, Stage, Grade, and Age. In addition, the levels of methylation sites in each subgroup were calculated, and 347 methylation sites (corresponding to 271 genes) were screened as the intrasubgroup specific methylation sites. Meanwhile, genes in the corresponding promoter regions that the above specific methylation sites were located were performed signaling pathway enrichment analysis. Results The specific genes were enriched to the biological pathways that were reported to be closely correlated with GC; moreover, the subsequent transcription factor enrichment analysis discovered that, these genes were mainly enriched into the cell response to transcription factor B, regulation of MAPK signaling pathways, and regulation of cell proliferation and metastasis. Eventually, the prognosis prediction model for GC patients was constructed using the Random Forest Classifier model, and the training set and test set data were carried out independent verification and test. Conclusions Such specific classification based on specific DNA methylation sites can well reflect the heterogeneity of GC tissues, which contributes to developing the individualized treatment and accurately predicting patient prognosis.