CloVR-Comparative: automated, cloud-enabled comparative microbial genome sequence analysis pipeline

Sonia Agrawal; Cesar Arze; Ricky S. Adkins; Jonathan Crabtree; David Riley; Mahesh Vangala; Kevin Galens; Claire M. Fraser; Hervé Tettelin; Owen White; Samuel V. Angiuoli; Anup Mahurkar; W. Florian Fricke

doi:10.1186/s12864-017-3717-3

BMC Genomics (Apr 2017)

CloVR-Comparative: automated, cloud-enabled comparative microbial genome sequence analysis pipeline

Sonia Agrawal,
Cesar Arze,
Ricky S. Adkins,
Jonathan Crabtree,
David Riley,
Mahesh Vangala,
Kevin Galens,
Claire M. Fraser,
Hervé Tettelin,
Owen White,
Samuel V. Angiuoli,
Anup Mahurkar,
W. Florian Fricke

Affiliations

Sonia Agrawal: Institute for Genome Sciences
Cesar Arze: Institute for Genome Sciences
Ricky S. Adkins: Institute for Genome Sciences
Jonathan Crabtree: Institute for Genome Sciences
David Riley: Institute for Genome Sciences
Mahesh Vangala: Institute for Genome Sciences
Kevin Galens: Institute for Genome Sciences
Claire M. Fraser: Institute for Genome Sciences
Hervé Tettelin: Institute for Genome Sciences
Owen White: Institute for Genome Sciences
Samuel V. Angiuoli: Institute for Genome Sciences
Anup Mahurkar: Institute for Genome Sciences
W. Florian Fricke: Institute for Genome Sciences

DOI: https://doi.org/10.1186/s12864-017-3717-3
Journal volume & issue: Vol. 18, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background The benefit of increasing genomic sequence data to the scientific community depends on easy-to-use, scalable bioinformatics support. CloVR-Comparative combines commonly used bioinformatics tools into an intuitive, automated, and cloud-enabled analysis pipeline for comparative microbial genomics. Results CloVR-Comparative runs on annotated complete or draft genome sequences that are uploaded by the user or selected via a taxonomic tree-based user interface and downloaded from NCBI. CloVR-Comparative runs reference-free multiple whole-genome alignments to determine unique, shared and core coding sequences (CDSs) and single nucleotide polymorphisms (SNPs). Output includes short summary reports and detailed text-based results files, graphical visualizations (phylogenetic trees, circular figures), and a database file linked to the Sybil comparative genome browser. Data up- and download, pipeline configuration and monitoring, and access to Sybil are managed through CloVR-Comparative web interface. CloVR-Comparative and Sybil are distributed as part of the CloVR virtual appliance, which runs on local computers or the Amazon EC2 cloud. Representative datasets (e.g. 40 draft and complete Escherichia coli genomes) are processed in <36 h on a local desktop or at a cost of <$20 on EC2. Conclusions CloVR-Comparative allows anybody with Internet access to run comparative genomics projects, while eliminating the need for on-site computational resources and expertise.

Published in BMC Genomics

ISSN: 1471-2164 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Biology (General): Genetics
Website: http://bmcgenomics.biomedcentral.com

About the journal

Abstract

Keywords