Applied Sciences (Jul 2022)

Macro SOStream: An Evolving Algorithm to Self Organizing Density-Based Clustering with Micro and Macroclusters

  • Andressa Stéfany Oliveira,
  • Rute Souza de Abreu,
  • Luiz Affonso Guedes

DOI
https://doi.org/10.3390/app12147161
Journal volume & issue
Vol. 12, no. 14
p. 7161

Abstract

Read online

This paper proposes a new evolving algorithm named Macro SOStream with entirely online learning and based on self-organizing density for data stream clustering. The Macro SOStream is based on the SOStream algorithm, but we incorporate macroclusters composed of microclusters. While microclusters have spherical shapes, macroclusters can have arbitrary shapes. Moreover, the Macro SOStream has the macrocluster merge functionality specially designed to improve its performance under data drift contexts. The Macro SOStream’s performance is compared to SOStream and DenStream algorithms’ performance using four synthetic datasets and the ARI performance metric to validate our proposal. Furthermore, we carry out an exhaustive analysis on the influence of adequate hyperparameter setup on these algorithms’ performance. As a result, the Macro SOStream presents good performance mainly in the context of data drift and for demands of non-spherical clusters.

Keywords