PLoS ONE (Jan 2023)

First steps into the cloud: Using Amazon data storage and computing with Python notebooks.

  • Daniel J Pollak,
  • Gautam Chawla,
  • Andrey Andreev,
  • David A Prober

DOI
https://doi.org/10.1371/journal.pone.0278316
Journal volume & issue
Vol. 18, no. 2
p. e0278316

Abstract

Read online

With the oncoming age of big data, biologists are encountering more use cases for cloud-based computing to streamline data processing and storage. Unfortunately, cloud platforms are difficult to learn, and there are few resources for biologists to demystify them. We have developed a guide for experimental biologists to set up cloud processing on Amazon Web Services to cheaply outsource data processing and storage. Here we provide a guide for setting up a computing environment in the cloud and showcase examples of using Python and Julia programming languages. We present example calcium imaging data in the zebrafish brain and corresponding analysis using suite2p software. Tools for budget and user management are further discussed in the attached protocol. Using this guide, researchers with limited coding experience can get started with cloud-based computing or move existing coding infrastructure into the cloud environment.