Cancer Medicine (Jan 2020)

A network‐based predictive gene expression signature for recurrence risks in stage II colorectal cancer

  • Wen‐Jing Yang,
  • Hai‐Bo Wang,
  • Wen‐Da Wang,
  • Peng‐Yu Bai,
  • Hong‐Xia Lu,
  • Chang‐He Sun,
  • Zi‐Shen Liu,
  • Ding‐Kun Guan,
  • Guo‐Wang Yang,
  • Gan‐Lin Zhang

DOI
https://doi.org/10.1002/cam4.2642
Journal volume & issue
Vol. 9, no. 1
pp. 179 – 193

Abstract

Read online

Abstract The current criteria for defining the recurrence risks of stage II colorectal cancer (CRC) are not robust; therefore, we aimed to explore novel gene signatures to predict recurrence risks and to reveal the underlying mechanisms of stage II CRC. First, the gene expression profiles of 124 patients with stage II CRC from The Cancer Genome Atlas (TCGA) database were obtained to screen differentially expressed genes (DEGs). A total of 202 DEGs, including 128 upregulated and 74 downregulated, were identified in the recurrence group (n = 24) compared to the nonrecurrence group (n = 100). Furthermore, the top 5 DEGs (ZNF561, WFS1, SLC2A1, MFI2, and PTGR1) were identified by random forest variable hunting, and four (ZNF561, WFS1, SLC2A1, and PTGR1) were selected to create a four‐gene recurrent model (GRM), with an area under the curve (AUC) of 0.882 according to the receiver operating characteristic curve, and the robust diagnostic effectiveness of the GRM was further validated with another gene expression profiling dataset (GSE12032), with an AUC of 0.943. The diagnostic effectiveness of the GRM regarding recurrence was associated with poor disease‐free survival in all stages of CRC. In addition, gene ontology functional annotation and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses revealed 18 enriched functions and 6 enriched pathways. Four genes, ABCG2, CACNA1F, CYP19A1, and TF, were identified as hub genes by the protein‐protein interaction network, which further validated that these genes were correlated with a poor pathologic stage and overall survival in all stages of CRC. In conclusion, the GRM can effectively classify stage II CRC into groups of high and low risks of recurrence, thereby making up for the prognostic value of the traditional clinicopathological risk factors defined by the National Comprehensive Cancer Network guidelines. The hub genes may be useful therapeutic targets for recurrence. Thus, the GRM and hub genes could offer clinical value in directing individualized and precision therapeutic regimens for stage II CRC patients.

Keywords