Epigenetics (Dec 2023)

Pruning and thresholding approach for methylation risk scores in multi-ancestry populations

  • Junyu Chen,
  • Evan Gatev,
  • Todd Everson,
  • Karen N. Conneely,
  • Nastassja Koen,
  • Michael P. Epstein,
  • Michael S. Kobor,
  • Heather J. Zar,
  • Dan J. Stein,
  • Anke Hüls

DOI
https://doi.org/10.1080/15592294.2023.2187172
Journal volume & issue
Vol. 18, no. 1

Abstract

Read online

Recent efforts have focused on developing methylation risk scores (MRS), a weighted sum of the individual’s DNA methylation (DNAm) values of pre-selected CpG sites. Most of the current MRS approaches that utilize Epigenome-wide association studies (EWAS) summary statistics only include genome-wide significant CpG sites and do not consider co-methylation. New methods that relax the p-value threshold to include more CpG sites and account for the inter-correlation of DNAm might improve the predictive performance of MRS. We paired informed co-methylation pruning with P-value thresholding to generate pruning and thresholding (P+T) MRS and evaluated its performance among multi-ancestry populations. Through simulation studies and real data analyses, we demonstrated that pruning provides an improvement over simple thresholding methods for prediction of phenotypes. We demonstrated that European-derived summary statistics can be used to develop P+T MRS among other populations such as African populations. However, the prediction accuracy of P+T MRS may differ across multi-ancestry population due to environmental/cultural/social differences.

Keywords