mBio (May 2013)

Evolutionary Genomics of <named-content content-type="genus-species">Salmonella enterica</named-content> Subspecies

  • Prerak T. Desai,
  • Steffen Porwollik,
  • Fred Long,
  • Pui Cheng,
  • Aye Wollam,
  • Sandra W. Clifton,
  • George M. Weinstock,
  • Michael McClelland

DOI
https://doi.org/10.1128/mBio.00579-12
Journal volume & issue
Vol. 4, no. 2

Abstract

Read online

ABSTRACT Six subspecies are currently recognized in Salmonella enterica. Subspecies I (subspecies enterica) is responsible for nearly all infections in humans and warm-blooded animals, while five other subspecies are isolated principally from cold-blooded animals. We sequenced 21 phylogenetically diverse strains, including two representatives from each of the previously unsequenced five subspecies and 11 diverse new strains from S. enterica subspecies enterica, to put this species into an evolutionary perspective. The phylogeny of the subspecies was partly obscured by abundant recombination events between lineages and a relatively short period of time within which subspeciation took place. Nevertheless, a variety of different tree-building methods gave congruent evolutionary tree topologies for subspeciation. A total of 285 gene families were identified that were recruited into subspecies enterica, and most of these are of unknown function. At least 2,807 gene families were identified in one or more of the other subspecies that are not found in subspecies I or Salmonella bongori. Among these gene families were 13 new candidate effectors and 7 new candidate fimbrial clusters. A third complete type III secretion system not present in subspecies enterica (I) isolates was found in both strains of subspecies salamae (II). Some gene families had complex taxonomies, such as the type VI secretion systems, which were recruited from four different lineages in five of six subspecies. Analysis of nonsynonymous-to-synonymous substitution rates indicated that the more-recently acquired regions in S. enterica are undergoing faster fixation rates than the rest of the genome. Recently acquired AT-rich regions, which often encode virulence functions, are under ongoing selection to maintain their high AT content. IMPORTANCE We have sequenced 21 new genomes which encompass the phylogenetic diversity of Salmonella, including strains of the previously unsequenced subspecies arizonae, diarizonae, houtenae, salamae, and indica as well as new diverse strains of subspecies enterica. We have deduced possible evolutionary paths traversed by this very important zoonotic pathogen and identified novel putative virulence factors that are not found in subspecies I. Gene families gained at the time of the evolution of subspecies enterica are of particular interest because they include mechanisms by which this subspecies adapted to warm-blooded hosts.