PeerJ (Jun 2015)

NxRepair: error correction in de novo sequence assembly using Nextera mate pairs

  • Rebecca R. Murphy,
  • Jared O’Connell,
  • Anthony J. Cox,
  • Ole Schulz-Trieglaff

DOI
https://doi.org/10.7717/peerj.996
Journal volume & issue
Vol. 3
p. e996

Abstract

Read online Read online

Scaffolding errors and incorrect repeat disambiguation during de novo assembly can result in large scale misassemblies in draft genomes. Nextera mate pair sequencing data provide additional information to resolve assembly ambiguities during scaffolding. Here, we introduce NxRepair, an open source toolkit for error correction in de novo assemblies that uses Nextera mate pair libraries to identify and correct large-scale errors. We show that NxRepair can identify and correct large scaffolding errors, without use of a reference sequence, resulting in quantitative improvements in the assembly quality. NxRepair can be downloaded from GitHub or PyPI, the Python Package Index; a tutorial and user documentation are also available.

Keywords