eLife (Aug 2021)
Patterns of within-host genetic diversity in SARS-CoV-2
- Gerry Tonkin-Hill,
- Inigo Martincorena,
- Roberto Amato,
- Andrew RJ Lawson,
- Moritz Gerstung,
- Ian Johnston,
- David K Jackson,
- Naomi Park,
- Stefanie V Lensing,
- Michael A Quail,
- Sónia Gonçalves,
- Cristina Ariani,
- Michael Spencer Chapman,
- William L Hamilton,
- Luke W Meredith,
- Grant Hall,
- Aminu S Jahun,
- Yasmin Chaudhry,
- Myra Hosmillo,
- Malte L Pinckert,
- Iliana Georgana,
- Anna Yakovleva,
- Laura G Caller,
- Sarah L Caddy,
- Theresa Feltwell,
- Fahad A Khokhar,
- Charlotte J Houldcroft,
- Martin D Curran,
- Surendra Parmar,
- The COVID-19 Genomics UK (COG-UK) Consortium,
- Alex Alderton,
- Rachel Nelson,
- Ewan M Harrison,
- John Sillitoe,
- Stephen D Bentley,
- Jeffrey C Barrett,
- M Estee Torok,
- Ian G Goodfellow,
- Cordelia Langford,
- Dominic Kwiatkowski,
- Wellcome Sanger Institute COVID-19 Surveillance Team
Affiliations
- Gerry Tonkin-Hill
- ORCiD
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Inigo Martincorena
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Roberto Amato
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Andrew RJ Lawson
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Moritz Gerstung
- European Bioinformatics Institute, Hinxton, United Kingdom
- Ian Johnston
- Wellcome Sanger Institute, Hinxton, United Kingdom
- David K Jackson
- ORCiD
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Naomi Park
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Stefanie V Lensing
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Michael A Quail
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Sónia Gonçalves
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Cristina Ariani
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Michael Spencer Chapman
- ORCiD
- Wellcome Sanger Institute, Hinxton, United Kingdom
- William L Hamilton
- ORCiD
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
- Luke W Meredith
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Grant Hall
- ORCiD
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Aminu S Jahun
- ORCiD
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Yasmin Chaudhry
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Myra Hosmillo
- ORCiD
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Malte L Pinckert
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Iliana Georgana
- ORCiD
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Anna Yakovleva
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Laura G Caller
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Sarah L Caddy
- ORCiD
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
- Theresa Feltwell
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Fahad A Khokhar
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom; Cambridge Institute of Therapeutic Immunology and Infectious Disease, University of Cambridge, Cambridge, United Kingdom
- Charlotte J Houldcroft
- ORCiD
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
- Martin D Curran
- Public Health England, Cambridge, United Kingdom
- Surendra Parmar
- Public Health England, Cambridge, United Kingdom
- The COVID-19 Genomics UK (COG-UK) Consortium
- Alex Alderton
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Rachel Nelson
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Ewan M Harrison
- Wellcome Sanger Institute, Hinxton, United Kingdom; European Bioinformatics Institute, Hinxton, United Kingdom
- John Sillitoe
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Stephen D Bentley
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Jeffrey C Barrett
- Wellcome Sanger Institute, Hinxton, United Kingdom
- M Estee Torok
- ORCiD
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
- Ian G Goodfellow
- ORCiD
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
- Cordelia Langford
- Wellcome Sanger Institute, Hinxton, United Kingdom
- Dominic Kwiatkowski
- Wellcome Sanger Institute, Hinxton, United Kingdom; Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
- Wellcome Sanger Institute COVID-19 Surveillance Team
- DOI
- https://doi.org/10.7554/eLife.66857
- Journal volume & issue
-
Vol. 10
Abstract
Monitoring the spread of SARS-CoV-2 and reconstructing transmission chains has become a major public health focus for many governments around the world. The modest mutation rate and rapid transmission of SARS-CoV-2 prevents the reconstruction of transmission chains from consensus genome sequences, but within-host genetic diversity could theoretically help identify close contacts. Here we describe the patterns of within-host diversity in 1181 SARS-CoV-2 samples sequenced to high depth in duplicate. 95.1% of samples show within-host mutations at detectable allele frequencies. Analyses of the mutational spectra revealed strong strand asymmetries suggestive of damage or RNA editing of the plus strand, rather than replication errors, dominating the accumulation of mutations during the SARS-CoV-2 pandemic. Within- and between-host diversity show strong purifying selection, particularly against nonsense mutations. Recurrent within-host mutations, many of which coincide with known phylogenetic homoplasies, display a spectrum and patterns of purifying selection more suggestive of mutational hotspots than recombination or convergent evolution. While allele frequencies suggest that most samples result from infection by a single lineage, we identify multiple putative examples of co-infection. Integrating these results into an epidemiological inference framework, we find that while sharing of within-host variants between samples could help the reconstruction of transmission chains, mutational hotspots and rare cases of superinfection can confound these analyses.
Keywords