Genome Biology (Jun 2022)

Storing and analyzing a genome on a blockchain

  • Gamze Gürsoy,
  • Charlotte M. Brannon,
  • Eric Ni,
  • Sarah Wagner,
  • Amol Khanna,
  • Mark Gerstein

DOI
https://doi.org/10.1186/s13059-022-02699-7
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 22

Abstract

Read online

Abstract There are major efforts underway to make genome sequencing a routine part of clinical practice. A critical barrier to these is achieving practical solutions for data ownership and integrity. Blockchain provides solutions to these challenges in other realms, such as finance. However, its use in genomics is stymied due to the difficulty in storing large-scale data on-chain, slow transaction speeds, and limitations on querying. To overcome these roadblocks, we developed a private blockchain network to store genomic variants and reference-aligned reads on-chain. It uses nested database indexing with an accompanying tool suite to rapidly access and analyze the data.

Keywords