PLoS ONE (Jan 2015)

Comparison of the Prognostic Utility of the Diverse Molecular Data among lncRNA, DNA Methylation, microRNA, and mRNA across Five Human Cancers.

  • Li Xu,
  • Liang Fengji,
  • Liu Changning,
  • Zhang Liangcai,
  • Li Yinghui,
  • Li Yu,
  • Chen Shanguang,
  • Xiong Jianghui

DOI
https://doi.org/10.1371/journal.pone.0142433
Journal volume & issue
Vol. 10, no. 11
p. e0142433

Abstract

Read online

Advances in high-throughput technologies have generated diverse informative molecular markers for cancer outcome prediction. Long non-coding RNA (lncRNA) and DNA methylation as new classes of promising markers are emerging as key molecules in human cancers; however, the prognostic utility of such diverse molecular data remains to be explored.We proposed a computational pipeline (IDFO) to predict patient survival by identifying prognosis-related biomarkers using multi-type molecular data (mRNA, microRNA, DNA methylation, and lncRNA) from 3198 samples of five cancer types. We assessed the predictive performance of both single molecular data and integrated multi-type molecular data in patient survival stratification, and compared their relative importance in each type of cancer, respectively. Survival analysis using multivariate Cox regression was performed to investigate the impact of the IDFO-identified markers and traditional variables on clinical outcome.Using the IDFO approach, we obtained good predictive performance of the molecular datasets (bootstrap accuracy: 0.71-0.97) in five cancer types. Impressively, lncRNA was identified as the best prognostic predictor in the validated cohorts of four cancer types, followed by DNA methylation, mRNA, and then microRNA. We found the incorporating of multi-type molecular data showed similar predictive power to single-type molecular data, but with the exception of the lncRNA + DNA methylation combinations in two cancers. Survival analysis of proportional hazard models confirmed a high robustness for lncRNA and DNA methylation as prognosis factors independent of traditional clinical variables.Our study provides insight into systematically understanding the prognostic performance of diverse molecular data in both single and aggregate patterns, which may have specific reference to subsequent related studies.