SAGE Open (Oct 2011)

Avoiding Disclosure of Individually Identifiable Health Information

  • Sergio I. Prada,
  • Claudia González-Martínez,
  • Joshua Borton,
  • Johannes Fernandes-Huessy,
  • Craig Holden,
  • Elizabeth Hair,
  • and Tim Mulcahy

DOI
https://doi.org/10.1177/2158244011431279
Journal volume & issue
Vol. 1

Abstract

Read online

Achieving data and information dissemination without harming anyone is a central task of any entity in charge of collecting data. In this article, the authors examine the literature on data and statistical confidentiality. Rather than comparing the theoretical properties of specific methods, they emphasize the main themes that emerge from the ongoing discussion among scientists regarding how best to achieve the appropriate balance between data protection, data utility, and data dissemination. They cover the literature on de-identification and reidentification methods with emphasis on health care data. The authors also discuss the benefits and limitations for the most common access methods. Although there is abundant theoretical and empirical research, their review reveals lack of consensus on fundamental questions for empirical practice: How to assess disclosure risk, how to choose among disclosure methods, how to assess reidentification risk, and how to measure utility loss.