Computational and Structural Biotechnology Journal (Jan 2022)

KOMB: K-core based de novo characterization of copy number variation in microbiomes

  • Advait Balaji,
  • Nicolae Sapoval,
  • Charlie Seto,
  • R.A. Leo Elworth,
  • Yilei Fu,
  • Michael G. Nute,
  • Tor Savidge,
  • Santiago Segarra,
  • Todd J. Treangen

Journal volume & issue
Vol. 20
pp. 3208 – 3222

Abstract

Read online

Characterizing metagenomes via kmer-based, database-dependent taxonomic classification has yielded key insights into underlying microbiome dynamics. However, novel approaches are needed to track community dynamics and genomic flux within metagenomes, particularly in response to perturbations. We describe KOMB, a novel method for tracking genome level dynamics within microbiomes. KOMB utilizes K-core decomposition to identify Structural variations (SVs), specifically, population-level Copy Number Variation (CNV) within microbiomes. K-core decomposition partitions the graph into shells containing nodes of induced degree at least K, yielding reduced computational complexity compared to prior approaches. Through validation on a synthetic community, we show that KOMB recovers and profiles repetitive genomic regions in the sample. KOMB is shown to identify functionally-important regions in Human Microbiome Project datasets, and was used to analyze longitudinal data and identify keystone taxa in Fecal Microbiota Transplantation (FMT) samples. In summary, KOMB represents a novel graph-based, taxonomy-oblivious, and reference-free approach for tracking CNV within microbiomes. KOMB is open source and available for download athttps://gitlab.com/treangenlab/komb.

Keywords