Genome Biology (Apr 2024)

NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads

  • Jiang Hu,
  • Zhuo Wang,
  • Zongyi Sun,
  • Benxia Hu,
  • Adeola Oluwakemi Ayoola,
  • Fan Liang,
  • Jingjing Li,
  • José R. Sandoval,
  • David N. Cooper,
  • Kai Ye,
  • Jue Ruan,
  • Chuan-Le Xiao,
  • Depeng Wang,
  • Dong-Dong Wu,
  • Sheng Wang

DOI
https://doi.org/10.1186/s13059-024-03252-4
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Long-read sequencing data, particularly those derived from the Oxford Nanopore sequencing platform, tend to exhibit high error rates. Here, we present NextDenovo, an efficient error correction and assembly tool for noisy long reads, which achieves a high level of accuracy in genome assembly. We apply NextDenovo to assemble 35 diverse human genomes from around the world using Nanopore long-read data. These genomes allow us to identify the landscape of segmental duplication and gene copy number variation in modern human populations. The use of NextDenovo should pave the way for population-scale long-read assembly using Nanopore long-read data.

Keywords