Entropy (Feb 2011)

Information Theoretic Hierarchical Clustering

  • Babak Nadjar Araabi,
  • Mehdi Aghagolzadeh,
  • Hamid Soltanian-Zadeh

DOI
https://doi.org/10.3390/e13020450
Journal volume & issue
Vol. 13, no. 2
pp. 450 – 465

Abstract

Read online

Hierarchical clustering has been extensively used in practice, where clusters can be assigned and analyzed simultaneously, especially when estimating the number of clusters is challenging. However, due to the conventional proximity measures recruited in these algorithms, they are only capable of detecting mass-shape clusters and encounter problems in identifying complex data structures. Here, we introduce two bottom-up hierarchical approaches that exploit an information theoretic proximity measure to explore the nonlinear boundaries between clusters and extract data structures further than the second order statistics. Experimental results on both artificial and real datasets demonstrate the superiority of the proposed algorithm compared to conventional and information theoretic clustering algorithms reported in the literature, especially in detecting the true number of clusters.

Keywords