Patterns of Reliability: Assessing the Reproducibility and Integrity of DNA Methylation Measurement

Karen Sugden; Eilis J. Hannon; Louise Arseneault; Daniel W. Belsky; David L. Corcoran; Helen L. Fisher; Renate M. Houts; Radhika Kandaswamy; Terrie E. Moffitt; Richie Poulton; Joseph A. Prinz; Line J.H. Rasmussen; Benjamin S. Williams; Chloe C.Y. Wong; Jonathan Mill; Avshalom Caspi

doi:10.1016/j.patter.2020.100014

Patterns (May 2020)

Patterns of Reliability: Assessing the Reproducibility and Integrity of DNA Methylation Measurement

Karen Sugden,
Eilis J. Hannon,
Louise Arseneault,
Daniel W. Belsky,
David L. Corcoran,
Helen L. Fisher,
Renate M. Houts,
Radhika Kandaswamy,
Terrie E. Moffitt,
Richie Poulton,
Joseph A. Prinz,
Line J.H. Rasmussen,
Benjamin S. Williams,
Chloe C.Y. Wong,
Jonathan Mill,
Avshalom Caspi

Affiliations

Karen Sugden: Department of Psychology and Neuroscience, Duke University, Grey Building, 2020 West Main Street, Suite 201, Durham, NC 27705, USA; Center for Genomic and Computational Biology, Duke University, Durham, NC, USA; Corresponding author
Eilis J. Hannon: Complex Disease Epigenetics Group, University of Exeter Medical School, Exeter, UK
Louise Arseneault: King's College London, Social, Genetic, and Developmental Psychiatry Research Centre, Institute of Psychiatry, Psychology, and Neuroscience, London, UK
Daniel W. Belsky: Department of Epidemiology & Butler Aging Center, Columbia University Mailman School of Public Health, New York, NY, USA
David L. Corcoran: Center for Genomic and Computational Biology, Duke University, Durham, NC, USA
Helen L. Fisher: King's College London, Social, Genetic, and Developmental Psychiatry Research Centre, Institute of Psychiatry, Psychology, and Neuroscience, London, UK
Renate M. Houts: Department of Psychology and Neuroscience, Duke University, Grey Building, 2020 West Main Street, Suite 201, Durham, NC 27705, USA
Radhika Kandaswamy: King's College London, Social, Genetic, and Developmental Psychiatry Research Centre, Institute of Psychiatry, Psychology, and Neuroscience, London, UK
Terrie E. Moffitt: Department of Psychology and Neuroscience, Duke University, Grey Building, 2020 West Main Street, Suite 201, Durham, NC 27705, USA; Center for Genomic and Computational Biology, Duke University, Durham, NC, USA; King's College London, Social, Genetic, and Developmental Psychiatry Research Centre, Institute of Psychiatry, Psychology, and Neuroscience, London, UK; Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine, Durham, NC, USA
Richie Poulton: Dunedin Multidisciplinary Health and Development Research Unit, University of Otago, Dunedin, New Zealand
Joseph A. Prinz: Center for Genomic and Computational Biology, Duke University, Durham, NC, USA
Line J.H. Rasmussen: Department of Psychology and Neuroscience, Duke University, Grey Building, 2020 West Main Street, Suite 201, Durham, NC 27705, USA; Clinical Research Centre, Copenhagen University Hospital Amager and Hvidovre, Hvidovre, Denmark
Benjamin S. Williams: Department of Psychology and Neuroscience, Duke University, Grey Building, 2020 West Main Street, Suite 201, Durham, NC 27705, USA; Center for Genomic and Computational Biology, Duke University, Durham, NC, USA
Chloe C.Y. Wong: King's College London, Social, Genetic, and Developmental Psychiatry Research Centre, Institute of Psychiatry, Psychology, and Neuroscience, London, UK
Jonathan Mill: Complex Disease Epigenetics Group, University of Exeter Medical School, Exeter, UK
Avshalom Caspi: Department of Psychology and Neuroscience, Duke University, Grey Building, 2020 West Main Street, Suite 201, Durham, NC 27705, USA; Center for Genomic and Computational Biology, Duke University, Durham, NC, USA; King's College London, Social, Genetic, and Developmental Psychiatry Research Centre, Institute of Psychiatry, Psychology, and Neuroscience, London, UK; Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine, Durham, NC, USA

DOI: https://doi.org/10.1016/j.patter.2020.100014
Journal volume & issue: Vol. 1, no. 2
p. 100014

Abstract

Read online

Summary: DNA methylation plays an important role in both normal human development and risk of disease. The most utilized method of assessing DNA methylation uses BeadChips, generating an epigenome-wide “snapshot” of >450,000 observations (probe measurements) per assay. However, the reliability of each of these measurements is not equal, and little consideration is paid to consequences for research. We correlated repeat measurements of the same DNA samples using the Illumina HumanMethylation450K and the Infinium MethylationEPIC BeadChips in 350 blood DNA samples. Probes that were reliably measured were more heritable and showed consistent associations with environmental exposures, gene expression, and greater cross-tissue concordance. Unreliable probes were less replicable and generated an unknown volume of false negatives. This serves as a lesson for working with DNA methylation data, but the lessons are equally applicable to working with other data: as we advance toward generating increasingly greater volumes of data, failure to document reliability risks harming reproducibility. The Bigger Picture: Although DNA methylation data are used widely by researchers in many fields, the reliability of these data are surprisingly variable. Our findings remind us that, in an age of increasingly big data, research is only as robust as its foundations. We hope that our findings will improve the integrity of DNA methylation studies. We also hope that our findings serve as a cautionary reminder for those generating and implementing big data of any type: reliability is a fundamental aspect of replicability. Conducting analysis with reliable data will improve chances of replicable findings, which might lead to more actionable targets for further research. To the extent that reliable data improve replicability, the knock-on effect will be more public confidence in research and less effort spent trying to replicate findings that are bound to fail.

Published in Patterns

ISSN: 2666-3899 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://www.cell.com/patterns

About the journal

Abstract

Keywords