Genome Biology (Sep 2023)

COLLAGENE enables privacy-aware federated and collaborative genomic data analysis

  • Wentao Li,
  • Miran Kim,
  • Kai Zhang,
  • Han Chen,
  • Xiaoqian Jiang,
  • Arif Harmanci

DOI
https://doi.org/10.1186/s13059-023-03039-z
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 38

Abstract

Read online

Abstract Growing regulatory requirements set barriers around genetic data sharing and collaborations. Moreover, existing privacy-aware paradigms are challenging to deploy in collaborative settings. We present COLLAGENE, a tool base for building secure collaborative genomic data analysis methods. COLLAGENE protects data using shared-key homomorphic encryption and combines encryption with multiparty strategies for efficient privacy-aware collaborative method development. COLLAGENE provides ready-to-run tools for encryption/decryption, matrix processing, and network transfers, which can be immediately integrated into existing pipelines. We demonstrate the usage of COLLAGENE by building a practical federated GWAS protocol for binary phenotypes and a secure meta-analysis protocol. COLLAGENE is available at https://zenodo.org/record/8125935 .

Keywords