mSystems (Jun 2020)

Advantages and Limits of Metagenomic Assembly and Binning of a Giant Virus

  • Frederik Schulz,
  • Julien Andreani,
  • Rania Francis,
  • Hadjer Boudjemaa,
  • Jacques Yaacoub Bou Khalil,
  • Janey Lee,
  • Bernard La Scola,
  • Tanja Woyke

DOI
https://doi.org/10.1128/mSystems.00048-20
Journal volume & issue
Vol. 5, no. 3

Abstract

Read online

ABSTRACT Giant viruses have large genomes, often within the size range of cellular organisms. This distinguishes them from most other viruses and demands additional effort for the successful recovery of their genomes from environmental sequence data. Here, we tested the performance of genome-resolved metagenomics on a recently isolated giant virus, Fadolivirus, by spiking it into an environmental sample from which two other giant viruses were isolated. At high spike-in levels, metagenome assembly and binning led to the successful genomic recovery of Fadolivirus from the sample. A complementary survey of the major capsid protein indicated the presence of other giant viruses in the sample matrix but did not detect the two isolated from this sample. Our results indicate that genome-resolved metagenomics is a valid approach for the recovery of near-complete giant virus genomes given that sufficient clonal particles are present. However, our data also underline that a vast majority of giant viruses remain currently undetected, even in an era of terabase-scale metagenomics. IMPORTANCE The discovery of large and giant nucleocytoplasmic large DNA viruses (NCLDV) with genomes in the megabase range and equipped with a wide variety of features typically associated with cellular organisms was one of the most unexpected, intriguing, and spectacular breakthroughs in virology. Recent studies suggest that these viruses are highly abundant in the oceans, freshwater, and soil, impact the biology and ecology of their eukaryotic hosts, and ultimately affect global nutrient cycles. Genome-resolved metagenomics is becoming an increasingly popular tool to assess the diversity and coding potential of giant viruses, but this approach is currently lacking validation.

Keywords