Nature Communications (Apr 2024)

KSNP: a fast de Bruijn graph-based haplotyping tool approaching data-in time cost

  • Qian Zhou,
  • Fahu Ji,
  • Dongxiao Lin,
  • Xianming Liu,
  • Zexuan Zhu,
  • Jue Ruan

DOI
https://doi.org/10.1038/s41467-024-47562-4
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 7

Abstract

Read online

Abstract Long reads that cover more variants per read raise opportunities for accurate haplotype construction, whereas the genotype errors of single nucleotide polymorphisms pose great computational challenges for haplotyping tools. Here we introduce KSNP, an efficient haplotype construction tool based on the de Bruijn graph (DBG). KSNP leverages the ability of DBG in handling high-throughput erroneous reads to tackle the challenges. Compared to other notable tools in this field, KSNP achieves at least 5-fold speedup while producing comparable haplotype results. The time required for assembling human haplotypes is reduced to nearly the data-in time.