PLoS ONE (Jan 2021)

Generalized linear models provide a measure of virulence for specific mutations in SARS-CoV-2 strains.

  • Anastasis Oulas,
  • Maria Zanti,
  • Marios Tomazou,
  • Margarita Zachariou,
  • George Minadakis,
  • Marilena M Bourdakou,
  • Pavlos Pavlidis,
  • George M Spyrou

DOI
https://doi.org/10.1371/journal.pone.0238665
Journal volume & issue
Vol. 16, no. 1
p. e0238665

Abstract

Read online

This study aims to highlight SARS-COV-2 mutations which are associated with increased or decreased viral virulence. We utilize genetic data from all strains available from GISAID and countries' regional information, such as deaths and cases per million, as well as COVID-19-related public health austerity measure response times. Initial indications of selective advantage of specific mutations can be obtained from calculating their frequencies across viral strains. By applying modelling approaches, we provide additional information that is not evident from standard statistics or mutation frequencies alone. We therefore, propose a more precise way of selecting informative mutations. We highlight two interesting mutations found in genes N (P13L) and ORF3a (Q57H). The former appears to be significantly associated with decreased deaths and cases per million according to our models, while the latter shows an opposing association with decreased deaths and increased cases per million. Moreover, protein structure prediction tools show that the mutations infer conformational changes to the protein that significantly alter its structure when compared to the reference protein.