Assessing the Reidentification Risks Posed by Deep Learning Algorithms Applied to ECG Data

Arin Ghazarian; Jianwei Zheng; Daniele Struppa; Cyril Rakovski

doi:10.1109/ACCESS.2022.3185615

IEEE Access (Jan 2022)

Assessing the Reidentification Risks Posed by Deep Learning Algorithms Applied to ECG Data

Arin Ghazarian,
Jianwei Zheng,
Daniele Struppa,
Cyril Rakovski

Affiliations

Arin Ghazarian: ORCiD; Schmid College of Science and Technology, Chapman University, Orange, CA, USA
Jianwei Zheng: ORCiD; Schmid College of Science and Technology, Chapman University, Orange, CA, USA
Daniele Struppa: Schmid College of Science and Technology, Chapman University, Orange, CA, USA
Cyril Rakovski: Schmid College of Science and Technology, Chapman University, Orange, CA, USA

DOI: https://doi.org/10.1109/ACCESS.2022.3185615
Journal volume & issue: Vol. 10
pp. 68711 – 68723

Abstract

Read online

ECG (Electrocardiogram) data analysis is one of the most widely used and important tools in cardiology diagnostics. In recent years the development of advanced deep learning techniques and GPU hardware have made it possible to train neural network models that attain exceptionally high levels of accuracy in complex tasks such as heart disease diagnoses and treatments. We investigate the use of ECGs as biometrics in human identification systems by implementing state-of-the-art deep learning models. We train convolutional neural network models on approximately 81k patients from the US, Germany and China. Currently, this is the largest research project on ECG identification. Our models achieved an overall accuracy of 95.69%. Furthermore, we assessed the accuracy of our ECG identification model for distinct groups of patients with particular heart conditions and combinations of such conditions. For example, we observed that the identification accuracy was the highest (99.7%) for patients with both ST changes and supraventricular tachycardia. We also found that the identification rate was the lowest for patients diagnosed with both atrial fibrillation and complete right bundle branch block (49%). We discuss the implications of these findings regarding the reidentification risks of patients based on ECG data and how seemingly anonymized ECG datasets can cause privacy concerns for the patients.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords