PLoS Biology (Jul 2015)

Big Data: Astronomical or Genomical?

  • Zachary D Stephens,
  • Skylar Y Lee,
  • Faraz Faghri,
  • Roy H Campbell,
  • Chengxiang Zhai,
  • Miles J Efron,
  • Ravishankar Iyer,
  • Michael C Schatz,
  • Saurabh Sinha,
  • Gene E Robinson

DOI
https://doi.org/10.1371/journal.pbio.1002195
Journal volume & issue
Vol. 13, no. 7
p. e1002195

Abstract

Read online

Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a "four-headed beast"--it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the "genomical" challenges of the next decade.