Neural networks memorise personal information from one sample

John Hartley; Pedro P. Sanchez; Fasih Haider; Sotirios A. Tsaftaris

doi:10.1038/s41598-023-48034-3

Scientific Reports (Dec 2023)

Neural networks memorise personal information from one sample

John Hartley,
Pedro P. Sanchez,
Fasih Haider,
Sotirios A. Tsaftaris

Affiliations

John Hartley: The University of Edinburgh
Pedro P. Sanchez: The University of Edinburgh
Fasih Haider: The University of Edinburgh
Sotirios A. Tsaftaris: The University of Edinburgh

DOI: https://doi.org/10.1038/s41598-023-48034-3
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Deep neural networks (DNNs) have achieved high accuracy in diagnosing multiple diseases/conditions at a large scale. However, a number of concerns have been raised about safeguarding data privacy and algorithmic bias of the neural network models. We demonstrate that unique features (UFs), such as names, IDs, or other patient information can be memorised (and eventually leaked) by neural networks even when it occurs on a single training data sample within the dataset. We explain this memorisation phenomenon by showing that it is more likely to occur when UFs are an instance of a rare concept. We propose methods to identify whether a given model does or does not memorise a given (known) feature. Importantly, our method does not require access to the training data and therefore can be deployed by an external entity. We conclude that memorisation does have implications on model robustness, but it can also pose a risk to the privacy of patients who consent to the use of their data for training models.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal