Scientific Reports (Aug 2024)

A program for real-time surveillance of SARS-CoV-2 genetics

  • Hayden N. Brochu,
  • Kuncheng Song,
  • Qimin Zhang,
  • Qiandong Zeng,
  • Adib Shafi,
  • Matthew Robinson,
  • Jake Humphrey,
  • Bobbi Croy,
  • Lydia Peavy,
  • Minoli Perera,
  • Scott Parker,
  • John Pruitt,
  • Jason Munroe,
  • Rama Ghatti,
  • Thomas J. Urban,
  • Ayla B. Harris,
  • David Alfego,
  • Brian Norvell,
  • Michael Levandoski,
  • Brian Krueger,
  • Jonathan D. Williams,
  • Deborah Boles,
  • Melinda B. Nye,
  • Suzanne E. Dale,
  • Michael Sapeta,
  • Christos J. Petropoulos,
  • Jonathan Meltzer,
  • Marcia Eisenberg,
  • Oren Cohen,
  • Stanley Letovsky,
  • Lakshmanan K. Iyer

DOI
https://doi.org/10.1038/s41598-024-70697-9
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract The COVID-19 pandemic brought forth an urgent need for widespread genomic surveillance for rapid detection and monitoring of emerging SARS-CoV-2 variants. It necessitated design, development, and deployment of a nationwide infrastructure designed for sequestration, consolidation, and characterization of patient samples that disseminates de-identified information to public authorities in tight turnaround times. Here, we describe our development of such an infrastructure, which sequenced 594,832 high coverage SARS-CoV-2 genomes from isolates we collected in the United States (U.S.) from March 13th 2020 to July 3rd 2023. Our sequencing protocol (‘Virseq’) utilizes wet and dry lab procedures to generate mutation-resistant sequencing of the entire SARS-CoV-2 genome, capturing all major lineages. We also characterize 379 clinically relevant SARS-CoV-2 multi-strain co-infections and ensure robust detection of emerging lineages via simulation. The modular infrastructure, sequencing, and analysis capabilities we describe support the U.S. Centers for Disease Control and Prevention national surveillance program and serve as a model for rapid response to emerging pandemics at a national scale.