npj Genomic Medicine (Feb 2024)

A cost-effective sequencing method for genetic studies combining high-depth whole exome and low-depth whole genome

  • Claude Bhérer,
  • Robert Eveleigh,
  • Katerina Trajanoska,
  • Janick St-Cyr,
  • Antoine Paccard,
  • Praveen Nadukkalam Ravindran,
  • Elizabeth Caron,
  • Nimara Bader Asbah,
  • Peyton McClelland,
  • Clare Wei,
  • Iris Baumgartner,
  • Marc Schindewolf,
  • Yvonne Döring,
  • Danielle Perley,
  • François Lefebvre,
  • Pierre Lepage,
  • Mathieu Bourgey,
  • Guillaume Bourque,
  • Jiannis Ragoussis,
  • Vincent Mooser,
  • Daniel Taliun

DOI
https://doi.org/10.1038/s41525-024-00390-3
Journal volume & issue
Vol. 9, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Whole genome sequencing (WGS) at high-depth (30X) allows the accurate discovery of variants in the coding and non-coding DNA regions and helps elucidate the genetic underpinnings of human health and diseases. Yet, due to the prohibitive cost of high-depth WGS, most large-scale genetic association studies use genotyping arrays or high-depth whole exome sequencing (WES). Here we propose a cost-effective method which we call “Whole Exome Genome Sequencing” (WEGS), that combines low-depth WGS and high-depth WES with up to 8 samples pooled and sequenced simultaneously (multiplexed). We experimentally assess the performance of WEGS with four different depth of coverage and sample multiplexing configurations. We show that the optimal WEGS configurations are 1.7–2.0 times cheaper than standard WES (no-plexing), 1.8–2.1 times cheaper than high-depth WGS, reach similar recall and precision rates in detecting coding variants as WES, and capture more population-specific variants in the rest of the genome that are difficult to recover when using genotype imputation methods. We apply WEGS to 862 patients with peripheral artery disease and show that it directly assesses more known disease-associated variants than a typical genotyping array and thousands of non-imputable variants per disease-associated locus.