IEEE Access (Jan 2019)

Density Peaks Clustering Based on Local Minimal Spanning Tree

  • Renmin Wang,
  • Qingsheng Zhu

DOI
https://doi.org/10.1109/ACCESS.2019.2927757
Journal volume & issue
Vol. 7
pp. 108438 – 108446

Abstract

Read online

The fake center is a common problem of density-based clustering algorithms, especially for datasets with clusters of different shapes and densities. Clustering by fast search and find of density peaks (DPC) and its improved versions often ignore the effect of fake centers on clustering quality. They usually have a poor performance even the actual number of centers are used. To solve this problem, we propose a density peaks clustering based on local minimal spanning tree (DPC-LMST), which generates initial clusters for each potential centers first and then introduce a sub-cluster merging factor (SCMF) to aggregate similar sub-clusters. Meanwhile, we introduce a new strategy of representative points to reduce the size of data and redefine local density $\rho _{i}$ and distance $\delta _{i}$ of each representative point. Furthermore, the hint of $\gamma $ is redesigned to highlight true centers for datasets with clusters of different densities. The proposed algorithm is benchmarked on both synthetic and real-world datasets, and we compare the results with K-means, DPC, and the three state-of-the-art improved DPC algorithms.

Keywords