Algorithms (Mar 2021)

Towards Understanding Clustering Problems and Algorithms: An Instance Space Analysis

  • Luiz Henrique dos Santos Fernandes,
  • Ana Carolina Lorena,
  • Kate Smith-Miles

DOI
https://doi.org/10.3390/a14030095
Journal volume & issue
Vol. 14, no. 3
p. 95

Abstract

Read online

Various criteria and algorithms can be used for clustering, leading to very distinct outcomes and potential biases towards datasets with certain structures. More generally, the selection of the most effective algorithm to be applied for a given dataset, based on its characteristics, is a problem that has been largely studied in the field of meta-learning. Recent advances in the form of a new methodology known as Instance Space Analysis provide an opportunity to extend such meta-analyses to gain greater visual insights of the relationship between datasets’ characteristics and the performance of different algorithms. The aim of this study is to perform an Instance Space Analysis for the first time for clustering problems and algorithms. As a result, we are able to analyze the impact of the choice of the test instances employed, and the strengths and weaknesses of some popular clustering algorithms, for datasets with different structures.

Keywords