F1000Research (Mar 2017)

Picopore: A tool for reducing the storage size of Oxford Nanopore Technologies datasets without loss of functionality [version 1; referees: 2 approved]

  • Scott Gigante

DOI
https://doi.org/10.12688/f1000research.11022.1
Journal volume & issue
Vol. 6

Abstract

Read online

Oxford Nanopore Technologies' (ONT) MinION and PromethION long-read sequencing technologies are emerging as genuine alternatives to established Next-Generation Sequencing technologies. A combination of the highly redundant file format and a rapid increase in data generation have created a significant problem both for immediate data storage on MinION-capable laptops, and for long-term storage on lab data servers. We developed Picopore, a software suite offering three methods of compression. Picopore's lossless and deep lossless methods provide a 25% and 44% average reduction in size, respectively, without removing any data from the files. Picopore's raw method provides an 88% average reduction in size, while retaining biologically relevant data for the end-user. All methods have the capacity to run in real-time in parallel to a sequencing run, reducing demand for both immediate and long-term storage space.

Keywords