PLoS ONE (Jan 2014)

STORMSeq: an open-source, user-friendly pipeline for processing personal genomics data in the cloud.

  • Konrad J Karczewski,
  • Guy Haskin Fernald,
  • Alicia R Martin,
  • Michael Snyder,
  • Nicholas P Tatonetti,
  • Joel T Dudley

DOI
https://doi.org/10.1371/journal.pone.0084860
Journal volume & issue
Vol. 9, no. 1
p. e84860

Abstract

Read online

The increasing public availability of personal complete genome sequencing data has ushered in an era of democratized genomics. However, read mapping and variant calling software is constantly improving and individuals with personal genomic data may prefer to customize and update their variant calls. Here, we describe STORMSeq (Scalable Tools for Open-Source Read Mapping), a graphical interface cloud computing solution that does not require a parallel computing environment or extensive technical experience. This customizable and modular system performs read mapping, read cleaning, and variant calling and annotation. At present, STORMSeq costs approximately $2 and 5-10 hours to process a full exome sequence and $30 and 3-8 days to process a whole genome sequence. We provide this open-access and open-source resource as a user-friendly interface in Amazon EC2.