Труды Института системного программирования РАН (Oct 2018)

Comparison of partial orders clustering techniques

  • A. Raskin

DOI
https://doi.org/10.15514/ISPRAS-2014-26(4)-7
Journal volume & issue
Vol. 26, no. 4
pp. 91 – 98

Abstract

Read online

In this paper, we compare three approaches of clustering partial ordered subsets of a set of items. First approach was k-medoids clustering algorithm with distance function based on Levenshtein distance. The second approach was k-means algorithm with cosine distance as distance function after vectorization of partial orders. And the third one was k-medoids algorithm with Kendall's tau as a distance function. We use Adjusted Rand Index as a measure of quality of clustering and find out that clustering with all three methods get stable results when variance of number of items ranked is high. Vectorization of partial orders get best results if number of items ranked is low.

Keywords