BMC Genomics (Nov 2024)

Removal of sequencing adapter contamination improves microbial genome databases

  • Andrew H. Moeller,
  • Brian A. Dillard,
  • Samantha L. Goldman,
  • Madalena V. F. Real,
  • Daniel D. Sprockett

DOI
https://doi.org/10.1186/s12864-024-10956-1
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 6

Abstract

Read online

Abstract Advances in assembling microbial genomes have led to growth of reference genome databases, which have been transformative for applied and basic microbiome research. Here we show that published microbial genome databases from humans, mice, cows, pigs, fish, honeybees, and marine environments contain significant sequencing-adapter contamination that systematically reduces assembly accuracy and contiguousness. By removing the adapter-contaminated ends of contiguous sequences and reassembling MGnify reference genomes, we improve the quality of assemblies in these databases.

Keywords