MOCAT: a metagenomics assembly and gene prediction toolkit.

Jens Roat Kultima; Shinichi Sunagawa; Junhua Li; Weineng Chen; Hua Chen; Daniel R Mende; Manimozhiyan Arumugam; Qi Pan; Binghang Liu; Junjie Qin; Jun Wang; Jun Wang; Peer Bork

doi:10.1371/journal.pone.0047656

PLoS ONE (Jan 2012)

MOCAT: a metagenomics assembly and gene prediction toolkit.

Jens Roat Kultima,
Shinichi Sunagawa,
Junhua Li,
Weineng Chen,
Hua Chen,
Daniel R Mende,
Manimozhiyan Arumugam,
Qi Pan,
Binghang Liu,
Junjie Qin,
Jun Wang,
Jun Wang,
Peer Bork

Affiliations

Jens Roat Kultima
Shinichi Sunagawa
Junhua Li
Weineng Chen
Hua Chen
Daniel R Mende
Manimozhiyan Arumugam
Qi Pan
Binghang Liu
Junjie Qin
Jun Wang
Jun Wang
Peer Bork

DOI: https://doi.org/10.1371/journal.pone.0047656
Journal volume & issue: Vol. 7, no. 10
p. e47656

Abstract

Read online

MOCAT is a highly configurable, modular pipeline for fast, standardized processing of single or paired-end sequencing data generated by the Illumina platform. The pipeline uses state-of-the-art programs to quality control, map, and assemble reads from metagenomic samples sequenced at a depth of several billion base pairs, and predict protein-coding genes on assembled metagenomes. Mapping against reference databases allows for read extraction or removal, as well as abundance calculations. Relevant statistics for each processing step can be summarized into multi-sheet Excel documents and queryable SQL databases. MOCAT runs on UNIX machines and integrates seamlessly with the SGE and PBS queuing systems, commonly used to process large datasets. The open source code and modular architecture allow users to modify or exchange the programs that are utilized in the various processing steps. Individual processing steps and parameters were benchmarked and tested on artificial, real, and simulated metagenomes resulting in an improvement of selected quality metrics. MOCAT can be freely downloaded at http://www.bork.embl.de/mocat/.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal