A scaling law to model the effectiveness of identification techniques

Luc Rocher; Julien M. Hendrickx; Yves-Alexandre de Montjoye

doi:10.1038/s41467-024-55296-6

Nature Communications (Jan 2025)

A scaling law to model the effectiveness of identification techniques

Luc Rocher,
Julien M. Hendrickx,
Yves-Alexandre de Montjoye

Affiliations

Luc Rocher: Oxford Internet Institute, University of Oxford
Julien M. Hendrickx: Information and Communication Technologies, Electronics and Applied Mathematics (ICTEAM), Université catholique de Louvain
Yves-Alexandre de Montjoye: Data Science Institute, Imperial College London

DOI: https://doi.org/10.1038/s41467-024-55296-6
Journal volume & issue: Vol. 16, no. 1
pp. 1 – 11

Abstract

Read online

Abstract AI techniques are increasingly being used to identify individuals both offline and online. However, quantifying their effectiveness at scale and, by extension, the risks they pose remains a significant challenge. Here, we propose a two-parameter Bayesian model for exact matching techniques and derive an analytical expression for correctness (κ), the fraction of people accurately identified in a population. We then generalize the model to forecast how κ scales from small-scale experiments to the real world, for exact, sparse, and machine learning-based robust identification techniques. Despite having only two degrees of freedom, our method closely fits 476 correctness curves and strongly outperforms curve-fitting methods and entropy-based rules of thumb. Our work provides a principled framework for forecasting the privacy risks posed by identification techniques, while also supporting independent accountability efforts for AI-based biometric systems.

Published in Nature Communications

ISSN: 2041-1723 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science
Website: https://www.nature.com/ncomms/

About the journal