IEEE Access (Jan 2023)

A New Density Peak Clustering Algorithm With Adaptive Clustering Center Based on Differential Privacy

  • Hua Chen,
  • Yuan Zhou,
  • Kehui Mei,
  • Nan Wang,
  • Guangxing Cai

DOI
https://doi.org/10.1109/ACCESS.2022.3233196
Journal volume & issue
Vol. 11
pp. 1418 – 1431

Abstract

Read online

A new density peak clustering (DPC) algorithm with adaptive clustering center based on differential privacy was proposed to solve the problems of poor adaptability of high-dimensional data, inability to automatically determine clustering centers, and privacy problems in clustering analysis. First, to solve the problem of poor adaptability of high-dimensional data, cosine distance was used to measure the similarity between high-dimensional datasets. Then, aiming at the subjective problem of clustering center selection, from the perspective of ranking graph, the weight $(i-1)/i$ was introduced creatively, the slope trend of ranking graph was redefined to realize the adaptive clustering center. Finally, aiming at the privacy problem, the Laplacian noise of appropriate privacy budget was added to the core statistic (local density) of the algorithm to achieve the balance between privacy protection and algorithm effectiveness. Experimental results on both the synthetic and UCI datasets show that this algorithm can not only realize the automatic selection of clustering center, but also solve the privacy problem in clustering analysis, and improve the clustering evaluation index greatly, which proves the effectiveness of the algorithm.

Keywords