Applied Sciences (Aug 2024)

A Differentially Private (Random) Decision Tree without Noise from <i>k</i>-Anonymity

  • Atsushi Waseda,
  • Ryo Nojima,
  • Lihua Wang

DOI
https://doi.org/10.3390/app14177625
Journal volume & issue
Vol. 14, no. 17
p. 7625

Abstract

Read online

This paper focuses on the relationship between decision trees, a typical machine learning method, and data anonymization. It is known that information leaked from trained decision trees can be evaluated using well-studied data anonymization techniques and that decision trees can be strengthened using k-anonymity and ℓ-diversity; unfortunately, however, this does not seem sufficient for differential privacy. In this paper, we show how one might apply k-anonymity to a (random) decision tree, which is a variant of the decision tree. Surprisingly, this results in differential privacy, which means that security is amplified from k-anonymity to differential privacy without the addition of noise.

Keywords