Jupyter and Galaxy: Easing entry barriers into complex data analyses for biomedical researchers.

Björn A Grüning; Eric Rasche; Boris Rebolledo-Jaramillo; Carl Eberhard; Torsten Houwaart; John Chilton; Nate Coraor; Rolf Backofen; James Taylor; Anton Nekrutenko

doi:10.1371/journal.pcbi.1005425

PLoS Computational Biology (May 2017)

Jupyter and Galaxy: Easing entry barriers into complex data analyses for biomedical researchers.

Björn A Grüning,
Eric Rasche,
Boris Rebolledo-Jaramillo,
Carl Eberhard,
Torsten Houwaart,
John Chilton,
Nate Coraor,
Rolf Backofen,
James Taylor,
Anton Nekrutenko

Affiliations

Björn A Grüning
Eric Rasche
Boris Rebolledo-Jaramillo
Carl Eberhard
Torsten Houwaart
John Chilton
Nate Coraor
Rolf Backofen
James Taylor
Anton Nekrutenko

DOI: https://doi.org/10.1371/journal.pcbi.1005425
Journal volume & issue: Vol. 13, no. 5
p. e1005425

Abstract

Read online

What does it take to convert a heap of sequencing data into a publishable result? First, common tools are employed to reduce primary data (sequencing reads) to a form suitable for further analyses (i.e., the list of variable sites). The subsequent exploratory stage is much more ad hoc and requires the development of custom scripts and pipelines, making it problematic for biomedical researchers. Here, we describe a hybrid platform combining common analysis pathways with the ability to explore data interactively. It aims to fully encompass and simplify the "raw data-to-publication" pathway and make it reproducible.

Published in PLoS Computational Biology

ISSN: 1553-734X (Print); 1553-7358 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Science: Biology (General)
Website: https://journals.plos.org/ploscompbiol/

About the journal