BMC Medical Genomics (Jul 2020)

Privately computing set-maximal matches in genomic data

  • Katerina Sotiraki,
  • Esha Ghosh,
  • Hao Chen

DOI
https://doi.org/10.1186/s12920-020-0718-x
Journal volume & issue
Vol. 13, no. S7
pp. 1 – 8

Abstract

Read online

Abstract Background Finding long matches in deoxyribonucleic acid (DNA) sequences in large aligned genetic sequences is a problem of great interest. A paradigmatic application is the identification of distant relatives via large common subsequences in DNA data. However, because of the sensitive nature of genomic data such computations without security consideration might compromise the privacy of the individuals involved. Methods The secret sharing technique enables the computation of matches while respecting the privacy of the inputs of the parties involved. This method requires interaction that depends on the circuit depth needed for the computation. Results We design a new depth-optimized algorithm for computing set-maximal matches between a database of aligned genetic sequences and the DNA of an individual while respecting the privacy of both the database owner and the individual. We then implement and evaluate our protocol. Conclusions Using modern cryptographic techniques, difficult genomic computations are performed in a privacy-preserving way. We enrich this research area by proposing a privacy-preserving protocol for set-maximal matches.

Keywords