IEEE Access (Jan 2022)

Piecemeal Clustering: a Self-Driven Data Clustering Algorithm

  • Md. Monjur Ul Hasan,
  • Reza Shahidi,
  • Dennis K. Peters,
  • Lesley James,
  • Ray Gosine

DOI
https://doi.org/10.1109/ACCESS.2022.3228238
Journal volume & issue
Vol. 10
pp. 129985 – 130000

Abstract

Read online

Various approaches have been discussed in the literature for the clustering of data, such as partitioning, hierarchical, and machine learning methods. Most of the approaches require some prior knowledge about the clusters, such as their total number. Furthermore, some previous algorithms are not robust enough to process higher-dimensional data or require a large amount of memory for computations. We propose, herein, a data clustering algorithm, Piecemeal Clustering, that successfully clusters data without prior knowledge of the number of clusters. The proposed clustering algorithm uses the similarity and density of the data to identify the number of clusters in the data set and works with both low- and high-dimensional data. We demonstrate the power of the proposed Piecemeal Clustering algorithm with two real-world data sets. It is found that the proposed algorithm outperforms seven other state-of-the-art algorithms on both of these data sets.

Keywords