eLife (Jan 2022)

The Mutationathon highlights the importance of reaching standardization in estimates of pedigree-based germline mutation rates

  • Lucie A Bergeron,
  • Søren Besenbacher,
  • Tychele Turner,
  • Cyril J Versoza,
  • Richard J Wang,
  • Alivia Lee Price,
  • Ellie Armstrong,
  • Meritxell Riera,
  • Jedidiah Carlson,
  • Hwei-yen Chen,
  • Matthew W Hahn,
  • Kelley Harris,
  • April Snøfrid Kleppe,
  • Elora H López-Nandam,
  • Priya Moorjani,
  • Susanne P Pfeifer,
  • George P Tiley,
  • Anne D Yoder,
  • Guojie Zhang,
  • Mikkel H Schierup

DOI
https://doi.org/10.7554/eLife.73577
Journal volume & issue
Vol. 11

Abstract

Read online

In the past decade, several studies have estimated the human per-generation germline mutation rate using large pedigrees. More recently, estimates for various nonhuman species have been published. However, methodological differences among studies in detecting germline mutations and estimating mutation rates make direct comparisons difficult. Here, we describe the many different steps involved in estimating pedigree-based mutation rates, including sampling, sequencing, mapping, variant calling, filtering, and appropriately accounting for false-positive and false-negative rates. For each step, we review the different methods and parameter choices that have been used in the recent literature. Additionally, we present the results from a ‘Mutationathon,’ a competition organized among five research labs to compare germline mutation rate estimates for a single pedigree of rhesus macaques. We report almost a twofold variation in the final estimated rate among groups using different post-alignment processing, calling, and filtering criteria, and provide details into the sources of variation across studies. Though the difference among estimates is not statistically significant, this discrepancy emphasizes the need for standardized methods in mutation rate estimations and the difficulty in comparing rates from different studies. Finally, this work aims to provide guidelines for computational and statistical benchmarks for future studies interested in identifying germline mutations from pedigrees.

Keywords