Electronics Letters (Dec 2022)

High‐frequency k‐mer counting at low memory footprint

  • Li Mocheng,
  • Liu Yang,
  • Xiao Nong,
  • Chen Zhiguang

DOI
https://doi.org/10.1049/ell2.12661
Journal volume & issue
Vol. 58, no. 25
pp. 940 – 942

Abstract

Read online

Abstract Genomics data analysis requires efficient tools to address the vast amount of data generated by current next‐generation sequencing technologies. K‐mer counting works face difficulties in balancing high memory overhead with statistical precision. We designed a high‐frequency k‐mer statistical computation based on the Space Saving algorithm and a novel hash table structure, which reduces the memory overhead by 46% while ensuring high computational efficiency.