PeerJ (Jul 2019)

A 3-mRNA-based prognostic signature of survival in oral squamous cell carcinoma

  • Ruoyan Cao,
  • Qiqi Wu,
  • Qiulan Li,
  • Mianfeng Yao,
  • Hongbo Zhou

DOI
https://doi.org/10.7717/peerj.7360
Journal volume & issue
Vol. 7
p. e7360

Abstract

Read online Read online

Background Oral squamous cell carcinoma (OSCC) is the most common type of head and neck squamous cell carcinoma with an unsatisfactory prognosis. The aim of this study was to identify potential prognostic mRNA biomarkers of OSCC based on analysis of The Cancer Genome Atlas (TCGA). Methods Expression profiles and clinical data of OSCC patients were collected from TCGA database. Univariate Cox analysis and the least absolute shrinkage and selection operator Cox (LASSO Cox) regression were used to primarily screen prognostic biomarkers. Then multivariate Cox analysis was performed to build a prognostic model based on the selected prognostic mRNAs. Nomograms were generated to predict the individual’s overall survival at 3 and 5 years. The model performance was assessed by the time-dependent receiver operating characteristic (ROC) curve and calibration plot in both training cohort and validation cohort (GSE41613 from NCBI GEO databases). In addition, machine learning was used to assess the importance of risk factors of OSCC. Finally, in order to explore the potential mechanisms of OSCC, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis was completed. Results Three mRNAs (CLEC3B, C6 and CLCN1) were finally identified as a prognostic biomarker pattern. The risk score was imputed as: (−0.38602 × expression level of CLEC3B) + (−0.20632 × expression level of CLCN1) + (0.31541 × expression level of C6). In the TCGA training cohort, the area under the curve (AUC) was 0.705 and 0.711 for 3- and 5-year survival, respectively. In the validation cohort, AUC was 0.718 and 0.717 for 3- and 5-year survival. A satisfactory agreement between predictive values and observation values was demonstrated by the calibration curve in the probabilities of 3- and 5- year survival in both cohorts. Furthermore, machine learning identified the 3-mRNA signature as the most important risk factor to survival of OSCC. Neuroactive ligand-receptor interaction was most enriched mostly in KEGG pathway analysis. Conclusion A 3-mRNA signature (CLEC3B, C6 and CLCN1) successfully predicted the survival of OSCC patients in both training and test cohort. In addition, this signature was an independent and the most important risk factor of OSCC.

Keywords