mSystems (Apr 2024)
Uncovering the boundaries of Campylobacter species through large-scale phylogenetic and nucleotide identity analyses
Abstract
ABSTRACTCampylobacter species are typically helical shaped, Gram-negative, and non-spore-forming bacteria. Species in this genus include established foodborne and animal pathogens as well as emerging pathogens. The accumulation of genomic data from the Campylobacter genus has increased exponentially in recent years, accompanied by the discovery of putative new species. At present, the lack of a standardized species boundary complicates distinguishing established and novel species. We defined the Campylobacter genus core genome (500 loci) using publicly available Campylobacter complete genomes (n = 498) and constructed a core genome phylogeny using 2,193 publicly available Campylobacter genomes to examine inter-species diversity and species boundaries. Utilizing 8,440 Campylobacter genomes representing 33 species and 8 subspecies, we found species delineation based on an average nucleotide identity (ANI) cutoff of 94.2% is consistent with the core genome phylogeny. We identified 60 ANI genomic species that delineated Campylobacter species in concordance with previous comparative genetic studies. All pairwise ANI genomic species pairs had in silico DNA-DNA hybridization scores of less than 70%, supporting their delineation as separate species. We provide the tool Campylobacter Genomic Species typer (CampyGStyper) that assigns ANI genomic species to query genomes based on ANI similarities to medoid genomes from each ANI genomic species with an accuracy of 99.96%. The ANI genomic species definitions proposed here allow consistent species definition in the Campylobacter genus and will facilitate the detection of novel species in the future.IMPORTANCEIn recent years, Campylobacter has gained recognition as the leading cause of bacterial gastroenteritis worldwide, leading to a substantial rise in the collection of genomic data of the Campylobacter genus in public databases. Currently, a standardized Campylobacter species boundary at the genomic level is absent, leading to challenges in detecting emerging pathogens and defining putative novel species within this genus. We used a comprehensive representation of genomes of the Campylobacter genus to construct a core genome phylogenetic tree. Furthermore, we found an average nucleotide identity (ANI) of 94.2% as the optimal cutoff to define the Campylobacter species. Using this cutoff, we identified 60 ANI genomic species which provided a standardized species definition and nomenclature. Importantly, we have developed Campylobacter Genomic Species typer (CampyGStyper), which can robustly and accurately assign these ANI genomic species to Campylobacter genomes, thereby aiding pathogen surveillance and facilitating evolutionary and epidemiological studies of existing and emerging pathogens in the genus Campylobacter.
Keywords