BMC Genomics (Nov 2023)
Identification of plasmids in avian-associated Escherichia coli using nanopore and illumina sequencing
Abstract
Abstract Background Avian pathogenic Escherichia coli (APEC) are the causative agents of colibacillosis in chickens, a disease which has significant economic impact on the poultry industry. Large plasmids detected in APEC are known to contribute to strain diversity for pathogenicity and antimicrobial resistance, but there could be other plasmids that are missed in standard analysis. In this study, we determined the impact of sequencing and assembly factors for the detection of plasmids in an E. coli whole genome sequencing project. Results Hybrid assembly (Illumina and Nanopore) combined with plasmid DNA extractions allowed for detection of the greatest number of plasmids in E. coli, as detected by MOB-suite software. In total, 79 plasmids were identified in 19 E. coli isolates. Hybrid assemblies were robust and consistent in quality regardless of sequencing kit used or if long reads were filtered or not. In contrast, long read only assemblies were more variable and influenced by sequencing and assembly parameters. Plasmid DNA extractions allowed for the detection of physically smaller plasmids, but when averaged over 19 isolates did not significantly change the overall number of plasmids detected. Conclusions Hybrid assembly can be reliably used to detect plasmids in E. coli, especially if researchers are focused on large plasmids containing antimicrobial resistance genes and virulence factors. If the goal is comprehensive detection of all plasmids, particularly if smaller sized vectors are desired for biotechnology applications, the addition of plasmid DNA extractions to hybrid assemblies is prudent. Long read sequencing is sufficient to detect many plasmids in E. coli, however, it is more prone to errors when expanded to analyze a large number of isolates.
Keywords