BMC Microbiology (Jun 2010)

<it>Escherichia coli </it>phylogenetic group determination and its application in the identification of the major animal source of fecal contamination

  • Amaral Luiz A,
  • Gomes Tânia AT,
  • Sato Maria IZ,
  • Hachich Elayse M,
  • Stoppe Nancy C,
  • Pires Mathias M,
  • Carlos Camila,
  • Ottoboni Laura MM

DOI
https://doi.org/10.1186/1471-2180-10-161
Journal volume & issue
Vol. 10, no. 1
p. 161

Abstract

Read online

Abstract Background Escherichia coli strains are commonly found in the gut microflora of warm-blooded animals. These strains can be assigned to one of the four main phylogenetic groups, A, B1, B2 and D, which can be divided into seven subgroups (A0, A1, B1, B22, B23, D1 and D2), according to the combination of the three genetic markers chuA, yjaA and DNA fragment TspE4.C2. Distinct studies have demonstrated that these phylo-groups differ in the presence of virulence factors, ecological niches and life-history. Therefore, the aim of this work was to analyze the distribution of these E. coli phylo-groups in 94 human strains, 13 chicken strains, 50 cow strains, 16 goat strains, 39 pig strains and 29 sheep strains and to verify the potential of this analysis to investigate the source of fecal contamination. Results The results indicated that the distribution of phylogenetic groups, subgroups and genetic markers is non-random in the hosts analyzed. Strains from group B1 were present in all hosts analyzed but were more prevalent in cow, goat and sheep samples. Subgroup B23 was only found in human samples. The diversity and the similarity indexes have indicated a similarity between the E. coli population structure of human and pig samples and among cow, goat and sheep samples. Correspondence analysis using contingence tables of subgroups, groups and genetic markers frequencies allowed the visualization of the differences among animal samples and the identification of the animal source of an external validation set. The classifier tools Binary logistic regression and Partial least square -- discriminant analysis, using the genetic markers profile of the strains, differentiated the herbivorous from the omnivorous strains, with an average error rate of 17%. Conclusions This is the first work, as far as we are aware, that identifies the major source of fecal contamination of a pool of strains instead of a unique strain. We concluded that the analysis of the E. coli population structure can be useful as a supplementary bacterial source tracking tool.