Genome Biology (Jun 2021)

GUNC: detection of chimerism and contamination in prokaryotic genomes

  • Askarbek Orakov,
  • Anthony Fullam,
  • Luis Pedro Coelho,
  • Supriya Khedkar,
  • Damian Szklarczyk,
  • Daniel R. Mende,
  • Thomas S. B. Schmidt,
  • Peer Bork

DOI
https://doi.org/10.1186/s13059-021-02393-0
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Genomes are critical units in microbiology, yet ascertaining quality in prokaryotic genome assemblies remains a formidable challenge. We present GUNC (the Genome UNClutterer), a tool that accurately detects and quantifies genome chimerism based on the lineage homogeneity of individual contigs using a genome’s full complement of genes. GUNC complements existing approaches by targeting previously underdetected types of contamination: we conservatively estimate that 5.7% of genomes in GenBank, 5.2% in RefSeq, and 15–30% of pre-filtered “high-quality” metagenome-assembled genomes in recent studies are undetected chimeras. GUNC provides a fast and robust tool to substantially improve prokaryotic genome quality.

Keywords