IEEE Access (Jan 2024)

Meta-iCVI: Ensemble Validity Metrics for Concise Labeling of Correct, Under- or Over-Partitioning in Streaming Clustering

  • Niklas M. Melton,
  • Sasha A. Petrenko,
  • Donald C. Wunsch

DOI
https://doi.org/10.1109/ACCESS.2023.3346058
Journal volume & issue
Vol. 12
pp. 11114 – 11124

Abstract

Read online

Understanding the performance and validity of clustering algorithms is both challenging and crucial, particularly when clustering must be done online. Until recently, most validation methods have relied on batch calculation and have required considerable human expertise in their interpretation. Improving real-time performance and interpretability of cluster validation, therefore, continues to be an important theme in unsupervised learning. Building upon previous work on incremental cluster validity indices (iCVIs), this paper introduces the Meta- iCVI as a tool for explainable and concise labeling of partition quality in online clustering. Leveraging a time-series classifier and data-fusion techniques, the Meta- iCVI combines the outputs of multiple iCVIs to produce a streaming label of either “over”, “under”, or “correctly” partitioned. Experiments were conducted on generalized synthetic and real-world data sets to demonstrate the efficacy and application of this method. Results of 100% accuracy were achieved in labeling partition quality on real-world data sets including MNIST and FLIR ADAS, demonstrating that the Meta- iCVI is a powerful and efficient tool for classifying partition quality in a variety of conditions. Its introduction should empower new and more efficient streaming clustering techniques. Additionally, we believe this to be the first implementation of an ensemble iCVI metric and the first time iCVI validation performance has been evaluated on randomized sample presentation.

Keywords