Nature Communications (Jul 2024)

Integrating muti-omics data to identify tissue-specific DNA methylation biomarkers for cancer risk

  • Yaohua Yang,
  • Yaxin Chen,
  • Shuai Xu,
  • Xingyi Guo,
  • Guochong Jia,
  • Jie Ping,
  • Xiang Shu,
  • Tianying Zhao,
  • Fangcheng Yuan,
  • Gang Wang,
  • Yufang Xie,
  • Hang Ci,
  • Hongmo Liu,
  • Yawen Qi,
  • Yongjun Liu,
  • Dan Liu,
  • Weimin Li,
  • Fei Ye,
  • Xiao-Ou Shu,
  • Wei Zheng,
  • Li Li,
  • Qiuyin Cai,
  • Jirong Long

DOI
https://doi.org/10.1038/s41467-024-50404-y
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 13

Abstract

Read online

Abstract The relationship between tissue-specific DNA methylation and cancer risk remains inadequately elucidated. Leveraging resources from the Genotype-Tissue Expression consortium, here we develop genetic models to predict DNA methylation at CpG sites across the genome for seven tissues and apply these models to genome-wide association study data of corresponding cancers, namely breast, colorectal, renal cell, lung, ovarian, prostate, and testicular germ cell cancers. At Bonferroni-corrected P < 0.05, we identify 4248 CpGs that are significantly associated with cancer risk, of which 95.4% (4052) are specific to a particular cancer type. Notably, 92 CpGs within 55 putative novel loci retain significant associations with cancer risk after conditioning on proximal signals identified by genome-wide association studies. Integrative multi-omics analyses reveal 854 CpG-gene-cancer trios, suggesting that DNA methylation at 309 distinct CpGs might influence cancer risk through regulating the expression of 205 unique cis-genes. These findings substantially advance our understanding of the interplay between genetics, epigenetics, and gene expression in cancer etiology.