Royal Society Open Science (Mar 2023)
Identifying the genetic basis of viral spillover using Lassa virus as a test case
Abstract
The rate at which zoonotic viruses spill over into the human population varies significantly over space and time. Remarkably, we do not yet know how much of this variation is attributable to genetic variation within viral populations. This gap in understanding arises because we lack methods of genetic analysis that can be easily applied to zoonotic viruses, where the number of available viral sequences is often limited, and opportunistic sampling introduces significant population stratification. Here, we explore the feasibility of using patterns of shared ancestry to correct for population stratification, enabling genome-wide association methods to identify genetic substitutions associated with spillover into the human population. Using a combination of phylogenetically structured simulations and Lassa virus sequences collected from humans and rodents in Sierra Leone, we demonstrate that existing methods do not fully correct for stratification, leading to elevated error rates. We also demonstrate, however, that the Type I error rate can be substantially reduced by confining the analysis to a less-stratified region of the phylogeny, even in an already-small dataset. Using this method, we detect two candidate single-nucleotide polymorphisms associated with spillover in the Lassa virus polymerase gene and provide generalized recommendations for the collection and analysis of zoonotic viruses.
Keywords