Communications Biology (Jun 2024)

Accelerating 3D genomics data analysis with Microcket

  • Yu Zhao,
  • Mengqi Yang,
  • Fanglei Gong,
  • Yuqi Pan,
  • Minghui Hu,
  • Qin Peng,
  • Leina Lu,
  • Xiaowen Lyu,
  • Kun Sun

DOI
https://doi.org/10.1038/s42003-024-06382-4
Journal volume & issue
Vol. 7, no. 1
pp. 1 – 7

Abstract

Read online

Abstract The three-dimensional (3D) organization of genome is fundamental to cell biology. To explore 3D genome, emerging high-throughput approaches have produced billions of sequencing reads, which is challenging and time-consuming to analyze. Here we present Microcket, a package for mapping and extracting interacting pairs from 3D genomics data, including Hi-C, Micro-C, and derivant protocols. Microcket utilizes a unique read-stitch strategy that takes advantage of the long read cycles in modern DNA sequencers; benchmark evaluations reveal that Microcket runs much faster than the current tools along with improved mapping efficiency, and thus shows high potential in accelerating and enhancing the biological investigations into 3D genome. Microcket is freely available at https://github.com/hellosunking/Microcket .