PLoS ONE (Jan 2015)

Using Whole Genome Analysis to Examine Recombination across Diverse Sequence Types of Staphylococcus aureus.

  • Elizabeth M Driebe,
  • Jason W Sahl,
  • Chandler Roe,
  • Jolene R Bowers,
  • James M Schupp,
  • John D Gillece,
  • Erin Kelley,
  • Lance B Price,
  • Talima R Pearson,
  • Crystal M Hepp,
  • Pius M Brzoska,
  • Craig A Cummings,
  • Manohar R Furtado,
  • Paal S Andersen,
  • Marc Stegger,
  • David M Engelthaler,
  • Paul S Keim

DOI
https://doi.org/10.1371/journal.pone.0130955
Journal volume & issue
Vol. 10, no. 7
p. e0130955

Abstract

Read online

Staphylococcus aureus is an important clinical pathogen worldwide and understanding this organism's phylogeny and, in particular, the role of recombination, is important both to understand the overall spread of virulent lineages and to characterize outbreaks. To further elucidate the phylogeny of S. aureus, 35 diverse strains were sequenced using whole genome sequencing. In addition, 29 publicly available whole genome sequences were included to create a single nucleotide polymorphism (SNP)-based phylogenetic tree encompassing 11 distinct lineages. All strains of a particular sequence type fell into the same clade with clear groupings of the major clonal complexes of CC8, CC5, CC30, CC45 and CC1. Using a novel analysis method, we plotted the homoplasy density and SNP density across the whole genome and found evidence of recombination throughout the entire chromosome, but when we examined individual clonal lineages we found very little recombination. However, when we analyzed three branches of multiple lineages, we saw intermediate and differing levels of recombination between them. These data demonstrate that in S. aureus, recombination occurs across major lineages that subsequently expand in a clonal manner. Estimated mutation rates for the CC8 and CC5 lineages were different from each other. While the CC8 lineage rate was similar to previous studies, the CC5 lineage was 100-fold greater. Fifty known virulence genes were screened in all genomes in silico to determine their distribution across major clades. Thirty-three genes were present variably across clades, most of which were not constrained by ancestry, indicating horizontal gene transfer or gene loss.