npj Viruses (Nov 2024)

Conservative taxonomy and quality assessment of giant virus genomes with GVClass

  • Thomas M. Pitot,
  • Tomáš Brůna,
  • Frederik Schulz

DOI
https://doi.org/10.1038/s44298-024-00069-7
Journal volume & issue
Vol. 2, no. 1
pp. 1 – 7

Abstract

Read online

Abstract Large double-stranded DNA viruses of the phylum Nucleocytoviricota (Giant Viruses; GVs) are the largest known viruses, infecting various eukaryotic hosts, particularly protists and algae. These viruses impact biogeochemical cycles and host genome evolution but are challenging to identify and classify due to their complex genomes. We present GVClass, a tool for identifying giant viruses in sequence data, providing taxonomic assignments, and estimating genome completeness and contamination. GVClass employs optimized gene calling and a conservative approach using consensus single-protein phylogenies for robust taxonomic classification, relying on highly conserved orthologous groups. Benchmarking demonstrates over 90% accuracy at the genus-level and >99% at higher taxonomic ranks. GVClass addresses classification challenges and is available as a standalone tool and integrated into the Integrated Microbial Genomes/Virus database (IMG/VR).