BMC Bioinformatics (Dec 2018)

Towards practical privacy-preserving genome-wide association study

  • Charlotte Bonte,
  • Eleftheria Makri,
  • Amin Ardeshirdavani,
  • Jaak Simm,
  • Yves Moreau,
  • Frederik Vercauteren

DOI
https://doi.org/10.1186/s12859-018-2541-3
Journal volume & issue
Vol. 19, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background The deployment of Genome-wide association studies (GWASs) requires genomic information of a large population to produce reliable results. This raises significant privacy concerns, making people hesitate to contribute their genetic information to such studies. Results We propose two provably secure solutions to address this challenge: (1) a somewhat homomorphic encryption (HE) approach, and (2) a secure multiparty computation (MPC) approach. Unlike previous work, our approach does not rely on adding noise to the input data, nor does it reveal any information about the patients. Our protocols aim to prevent data breaches by calculating the χ 2 statistic in a privacy-preserving manner, without revealing any information other than whether the statistic is significant or not. Specifically, our protocols compute the χ 2 statistic, but only return a yes/no answer, indicating significance. By not revealing the statistic value itself but only the significance, our approach thwarts attacks exploiting statistic values. We significantly increased the efficiency of our HE protocols by introducing a new masking technique to perform the secure comparison that is necessary for determining significance. Conclusions We show that full-scale privacy-preserving GWAS is practical, as long as the statistics can be computed by low degree polynomials. Our implementations demonstrated that both approaches are efficient. The secure multiparty computation technique completes its execution in approximately 2 ms for data contributed by one million subjects.

Keywords