Genome Biology (Jul 2021)

STRONG: metagenomics strain resolution on assembly graphs

  • Christopher Quince,
  • Sergey Nurk,
  • Sebastien Raguideau,
  • Robert James,
  • Orkun S. Soyer,
  • J. Kimberly Summers,
  • Antoine Limasset,
  • A. Murat Eren,
  • Rayan Chikhi,
  • Aaron E. Darling

DOI
https://doi.org/10.1186/s13059-021-02419-7
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 34

Abstract

Read online

Abstract We introduce STrain Resolution ON assembly Graphs (STRONG), which identifies strains de novo, from multiple metagenome samples. STRONG performs coassembly, and binning into metagenome assembled genomes (MAGs), and stores the coassembly graph prior to variant simplification. This enables the subgraphs and their unitig per-sample coverages, for individual single-copy core genes (SCGs) in each MAG, to be extracted. A Bayesian algorithm, BayesPaths, determines the number of strains present, their haplotypes or sequences on the SCGs, and abundances. STRONG is validated using synthetic communities and for a real anaerobic digestor time series generates haplotypes that match those observed from long Nanopore reads.

Keywords