LTU Attacker for Membership Inference

Joseph Pedersen; Rafael Muñoz-Gómez; Jiangnan Huang; Haozhe Sun; Wei-Wei Tu; Isabelle Guyon

doi:10.3390/a15070254

Algorithms (Jul 2022)

LTU Attacker for Membership Inference

Joseph Pedersen,
Rafael Muñoz-Gómez,
Jiangnan Huang,
Haozhe Sun,
Wei-Wei Tu,
Isabelle Guyon

Affiliations

Joseph Pedersen: Department of Industrial and Systems Engineering, Rensselaer Polytechnic Institute, Troy, NY 12180, USA
Rafael Muñoz-Gómez: LISN/CNRS/INRIA, Paris-Saclay University, 3 Rue Joliot Curie Bâtiment Breguet, 91190 Gif-sur-Yvette, France
Jiangnan Huang: LISN/CNRS/INRIA, Paris-Saclay University, 3 Rue Joliot Curie Bâtiment Breguet, 91190 Gif-sur-Yvette, France
Haozhe Sun: LISN/CNRS/INRIA, Paris-Saclay University, 3 Rue Joliot Curie Bâtiment Breguet, 91190 Gif-sur-Yvette, France
Wei-Wei Tu: 4Paradigm, 66 Qinghe Middle Street, Beijing 100089, China
Isabelle Guyon: LISN/CNRS/INRIA, Paris-Saclay University, 3 Rue Joliot Curie Bâtiment Breguet, 91190 Gif-sur-Yvette, France

DOI: https://doi.org/10.3390/a15070254
Journal volume & issue: Vol. 15, no. 7
p. 254

Abstract

Read online

We address the problem of defending predictive models, such as machine learning classifiers (Defender models), against membership inference attacks, in both the black-box and white-box setting, when the trainer and the trained model are publicly released. The Defender aims at optimizing a dual objective: utility and privacy. Privacy is evaluated with the membership prediction error of a so-called “Leave-Two-Unlabeled” LTU Attacker, having access to all of the Defender and Reserved data, except for the membership label of one sample from each, giving the strongest possible attack scenario. We prove that, under certain conditions, even a “naïve” LTU Attacker can achieve lower bounds on privacy loss with simple attack strategies, leading to concrete necessary conditions to protect privacy, including: preventing over-fitting and adding some amount of randomness. This attack is straightforward to implement against any model trainer, and we demonstrate its performance against MemGaurd. However, we also show that such a naïve LTU Attacker can fail to attack the privacy of models known to be vulnerable in the literature, demonstrating that knowledge must be complemented with strong attack strategies to turn the LTU Attacker into a powerful means of evaluating privacy. The LTU Attacker can incorporate any existing attack strategy to compute individual privacy scores for each training sample. Our experiments on the QMNIST, CIFAR-10, and Location-30 datasets validate our theoretical results and confirm the roles of over-fitting prevention and randomness in the algorithms to protect against privacy attacks.

Published in Algorithms

ISSN: 1999-4893 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/algorithms

About the journal

Abstract

Keywords