mBio (Oct 2020)

An Early Pandemic Analysis of SARS-CoV-2 Population Structure and Dynamics in Arizona

  • Jason T. Ladner,
  • Brendan B. Larsen,
  • Jolene R. Bowers,
  • Crystal M. Hepp,
  • Evan Bolyen,
  • Megan Folkerts,
  • Krystal Sheridan,
  • Ashlyn Pfeiffer,
  • Hayley Yaglom,
  • Darrin Lemmer,
  • Jason W. Sahl,
  • Emily A. Kaelin,
  • Rabia Maqsood,
  • Nicholas A. Bokulich,
  • Grace Quirk,
  • Thomas D. Watts,
  • Kenneth K. Komatsu,
  • Victor Waddell,
  • Efrem S. Lim,
  • J. Gregory Caporaso,
  • David M. Engelthaler,
  • Michael Worobey,
  • Paul Keim

DOI
https://doi.org/10.1128/mBio.02107-20
Journal volume & issue
Vol. 11, no. 5

Abstract

Read online

ABSTRACT In December of 2019, a novel coronavirus, SARS-CoV-2, emerged in the city of Wuhan, China, causing severe morbidity and mortality. Since then, the virus has swept across the globe, causing millions of confirmed infections and hundreds of thousands of deaths. To better understand the nature of the pandemic and the introduction and spread of the virus in Arizona, we sequenced viral genomes from clinical samples tested at the TGen North Clinical Laboratory, the Arizona Department of Health Services, and those collected as part of community surveillance projects at Arizona State University and the University of Arizona. Phylogenetic analysis of 84 genomes from across Arizona revealed a minimum of 11 distinct introductions inferred to have occurred during February and March. We show that >80% of our sequences descend from strains that were initially circulating widely in Europe but have since dominated the outbreak in the United States. In addition, we show that the first reported case of community transmission in Arizona descended from the Washington state outbreak that was discovered in late February. Notably, none of the observed transmission clusters are epidemiologically linked to the original travel-related case in the state, suggesting successful early isolation and quarantine. Finally, we use molecular clock analyses to demonstrate a lack of identifiable, widespread cryptic transmission in Arizona prior to the middle of February 2020. IMPORTANCE As the COVID-19 pandemic swept across the United States, there was great differential impact on local and regional communities. One of the earliest and hardest hit regions was in New York, while at the same time Arizona (for example) had low incidence. That situation has changed dramatically, with Arizona now having the highest rate of disease increase in the country. Understanding the roots of the pandemic during the initial months is essential as the pandemic continues and reaches new heights. Genomic analysis and phylogenetic modeling of SARS-COV-2 in Arizona can help to reconstruct population composition and predict the earliest undetected introductions. This foundational work represents the basis for future analysis and understanding as the pandemic continues.

Keywords