Clinical Epigenetics (Nov 2024)
Technical variability across the 450K, EPICv1, and EPICv2 DNA methylation arrays: lessons learned for clinical and longitudinal studies
Abstract
Abstract DNA methylation (DNAm) is the most commonly measured epigenetic mechanism in human populations, with most studies using Illumina arrays to assess DNAm levels. In 2023, Illumina updated their DNAm arrays to the EPIC version 2 (EPICv2), building on prior iterations, namely the EPIC version 1 (EPICv1) and 450K arrays. Whether DNAm measurements are stable across these three generations of arrays has yet not been investigated, limiting the ability of researchers—especially those with longitudinal data—to compare and replicate results across arrays. Here, we present results from a study of 30 child participants (15 male; 15 female) from the Drakenstein Child Health Study, who had DNAm measured on all three of the latest arrays: 450K, EPICv1, and EPICv2. Using these data, we created an annotation of probe quality across arrays, which includes the intraclass correlations, interquartile ranges, correlations, and array bias (i.e., the extent to which DNAm levels were explained by array type) of all CpGs. We also present results from an analysis of sex differences, where we found that CpGs with lower replicability across arrays had higher array-based variance, suggesting this variance metric help guide replication efforts. We also showed that epigenetic age estimates across arrays were more stable when using the principal component versions of epigenetic clocks. Ultimately, this collection of results provides a framework for investigating the replicability and longitudinal stability of epigenetic changes across multiple versions of Illumina DNAm arrays.
Keywords