Genome Biology (Feb 2019)

Skmer: assembly-free and alignment-free sample identification using genome skims

  • Shahab Sarmashghi,
  • Kristine Bohmann,
  • M. Thomas P. Gilbert,
  • Vineet Bafna,
  • Siavash Mirarab

DOI
https://doi.org/10.1186/s13059-019-1632-4
Journal volume & issue
Vol. 20, no. 1
pp. 1 – 20

Abstract

Read online

Abstract The ability to inexpensively describe taxonomic diversity is critical in this era of rapid climate and biodiversity changes. The recent genome-skimming approach extends current barcoding practices beyond short markers by applying low-pass sequencing and recovering whole organelle genomes computationally. This approach discards the nuclear DNA, which constitutes the vast majority of the data. In contrast, we suggest using all unassembled reads. We introduce an assembly-free and alignment-free tool, Skmer, to compute genomic distances between the query and reference genome skims. Skmer shows excellent accuracy in estimating distances and identifying the closest match in reference datasets.

Keywords