Frontiers in Neuroinformatics (Nov 2017)

Running Neuroimaging Applications on Amazon Web Services: How, When, and at What Cost?

  • Tara M. Madhyastha,
  • Natalie Koh,
  • Trevor K. M. Day,
  • Moises Hernández-Fernández,
  • Austin Kelley,
  • Daniel J. Peterson,
  • Sabreena Rajan,
  • Karl A. Woelfer,
  • Jonathan Wolf,
  • Thomas J. Grabowski,
  • Thomas J. Grabowski

DOI
https://doi.org/10.3389/fninf.2017.00063
Journal volume & issue
Vol. 11

Abstract

Read online

The contribution of this paper is to identify and describe current best practices for using Amazon Web Services (AWS) to execute neuroimaging workflows “in the cloud.” Neuroimaging offers a vast set of techniques by which to interrogate the structure and function of the living brain. However, many of the scientists for whom neuroimaging is an extremely important tool have limited training in parallel computation. At the same time, the field is experiencing a surge in computational demands, driven by a combination of data-sharing efforts, improvements in scanner technology that allow acquisition of images with higher image resolution, and by the desire to use statistical techniques that stress processing requirements. Most neuroimaging workflows can be executed as independent parallel jobs and are therefore excellent candidates for running on AWS, but the overhead of learning to do so and determining whether it is worth the cost can be prohibitive. In this paper we describe how to identify neuroimaging workloads that are appropriate for running on AWS, how to benchmark execution time, and how to estimate cost of running on AWS. By benchmarking common neuroimaging applications, we show that cloud computing can be a viable alternative to on-premises hardware. We present guidelines that neuroimaging labs can use to provide a cluster-on-demand type of service that should be familiar to users, and scripts to estimate cost and create such a cluster.

Keywords