Cancer Cell International (Sep 2020)

Identification and validation of methylation-driven genes prognostic signature for recurrence of laryngeal squamous cell carcinoma by integrated bioinformatics analysis

  • Jie Cui,
  • Liping Wang,
  • Waisheng Zhong,
  • Zhen Chen,
  • Jie Chen,
  • Hong Yang,
  • Genglong Liu

DOI
https://doi.org/10.1186/s12935-020-01567-3
Journal volume & issue
Vol. 20, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Background Recurrence remains a major obstacle to long-term survival of laryngeal squamous cell carcinoma (LSCC). We conducted a genome-wide integrated analysis of methylation and the transcriptome to establish methylation-driven genes prognostic signature (MDGPS) to precisely predict recurrence probability and optimize therapeutic strategies for LSCC. Methods LSCC DNA methylation datasets and RNA sequencing (RNA-seq) dataset were acquired from the Cancer Genome Atlas (TCGA). MethylMix was applied to detect DNA methylation-driven genes (MDGs). By univariate and multivariate Cox regression analyses, five genes of DNA MDGs was developed a recurrence-free survival (RFS)-related MDGPS. The predictive accuracy and clinical value of the MDGPS were evaluated by receiver operating characteristic (ROC) and decision curve analysis (DCA), and compared with TNM stage system. Additionally, prognostic value of MDGPS was validated by external Gene Expression Omnibus (GEO) database. According to 5 MDGs, the candidate small molecules for LSCC were screen out by the CMap database. To strengthen the bioinformatics analysis results, 30 pairs of clinical samples were evaluated by digoxigenin-labeled chromogenic in situ hybridization (CISH). Results A total of 88 DNA MDGs were identified, and five RFS-related MDGs (LINC01354, CCDC8, PHYHD1, MAGEB2 and ZNF732) were chosen to construct a MDGPS. The MDGPS can effectively divide patients into high-risk and low-risk group, with the area under curve (AUC) of 0.738 (5-year RFS) and AUC of 0.74 (3-year RFS). Stratification analysis affirmed that the MDGPS was still a significant statistical prognostic model in subsets of patients with different clinical variables. Multivariate Cox regression analysis indicated the efficacy of MDGPS appears independent of other clinicopathological characteristics. In terms of predictive capacity and clinical usefulness, the MDGPS was superior to traditional TNM stage. Additionally, the MDGPS was confirmed in external LSCC cohorts from GEO. CMap matched the 9 most significant small molecules as promising therapeutic drugs to reverse the LSCC gene expression. Finally, CISH analysis in 30 LSCC tissues and paired adjacent normal tissues revealed that MAGEB2 has significantly higher expression of LSCC compared to adjacent non-neoplastic tissues; LINC01354, CCDC8, PHYHD1, and ZNF732 have significantly lower expression of LSCC compared to adjacent non-neoplastic tissues, which were in line with bioinformatics analysis results. Conclusion A MDGPS, with five DNA MDGs, was identified and validated in LSCC patients by combining transcriptome and methylation datasets analysis. Compared TNM stage alone, it generates more accurate estimations of the recurrence prediction and maybe offer novel research directions and prospects for individualized treatment of patients with LSCC.

Keywords