Patterns (May 2020)

Patterns of Reliability: Assessing the Reproducibility and Integrity of DNA Methylation Measurement

  • Karen Sugden,
  • Eilis J. Hannon,
  • Louise Arseneault,
  • Daniel W. Belsky,
  • David L. Corcoran,
  • Helen L. Fisher,
  • Renate M. Houts,
  • Radhika Kandaswamy,
  • Terrie E. Moffitt,
  • Richie Poulton,
  • Joseph A. Prinz,
  • Line J.H. Rasmussen,
  • Benjamin S. Williams,
  • Chloe C.Y. Wong,
  • Jonathan Mill,
  • Avshalom Caspi

Journal volume & issue
Vol. 1, no. 2
p. 100014

Abstract

Read online

Summary: DNA methylation plays an important role in both normal human development and risk of disease. The most utilized method of assessing DNA methylation uses BeadChips, generating an epigenome-wide “snapshot” of >450,000 observations (probe measurements) per assay. However, the reliability of each of these measurements is not equal, and little consideration is paid to consequences for research. We correlated repeat measurements of the same DNA samples using the Illumina HumanMethylation450K and the Infinium MethylationEPIC BeadChips in 350 blood DNA samples. Probes that were reliably measured were more heritable and showed consistent associations with environmental exposures, gene expression, and greater cross-tissue concordance. Unreliable probes were less replicable and generated an unknown volume of false negatives. This serves as a lesson for working with DNA methylation data, but the lessons are equally applicable to working with other data: as we advance toward generating increasingly greater volumes of data, failure to document reliability risks harming reproducibility. The Bigger Picture: Although DNA methylation data are used widely by researchers in many fields, the reliability of these data are surprisingly variable. Our findings remind us that, in an age of increasingly big data, research is only as robust as its foundations. We hope that our findings will improve the integrity of DNA methylation studies. We also hope that our findings serve as a cautionary reminder for those generating and implementing big data of any type: reliability is a fundamental aspect of replicability. Conducting analysis with reliable data will improve chances of replicable findings, which might lead to more actionable targets for further research. To the extent that reliable data improve replicability, the knock-on effect will be more public confidence in research and less effort spent trying to replicate findings that are bound to fail.

Keywords