E3S Web of Conferences (Jan 2021)

Features Clustering Around Latent Variables for High Dimensional Data

  • Ghizlane Ez-Zarrad,
  • Wafae Sabbar,
  • Abdelkrim Bekkhoucha

DOI
https://doi.org/10.1051/e3sconf/202129701070
Journal volume & issue
Vol. 297
p. 01070

Abstract

Read online

Clustering of variables is the task of grouping similar variables into different groups. It may be useful in several situations such as dimensionality reduction, feature selection, and detect redundancies. In the present study, we combine two methods of features clustering the clustering of variables around latent variables (CLV) algorithm and the k-means based co-clustering algorithm (kCC). Indeed, classical CLV cannot be applied to high dimensional data because this approach becomes tedious when the number of features increases.