IEEE Access (Jan 2019)

Fast Adaptive K-Means Subspace Clustering for High-Dimensional Data

  • Xiao-Dong Wang,
  • Rung-Ching Chen,
  • Fei Yan,
  • Zhi-Qiang Zeng,
  • Chao-Qun Hong

DOI
https://doi.org/10.1109/ACCESS.2019.2907043
Journal volume & issue
Vol. 7
pp. 42639 – 42651

Abstract

Read online

In many real-world applications, data are represented by high-dimensional features. Despite the simplicity, existing K-means subspace clustering algorithms often employ eigenvalue decomposition to generate an approximate solution, which makes the model less efficiency. Besides, their loss functions are either sensitive to outliers or small loss errors. In this paper, we propose a fast adaptive K-means (FAKM) type subspace clustering model, where an adaptive loss function is designed to provide a flexible cluster indicator calculation mechanism, thereby suitable for datasets under different distributions. To find the optimal feature subset, FAKM performs clustering and feature selection simultaneously without the eigenvalue decomposition, therefore efficient for real-world applications. We exploit an efficient alternative optimization algorithm to solve the proposed model, together with theoretical analyses on its convergence and computational complexity. Finally, extensive experiments on several benchmark datasets demonstrate the advantages of FAKM compared to state-of-the-art clustering algorithms.

Keywords