Journal of Applied Science and Engineering (Sep 2024)

Deep Mutual Information Decoupling based Unsupervised Image Clustering

  • Yanfeng Wang,
  • Jinfeng Wang,
  • Weirong Zhang

DOI
https://doi.org/10.6180/jase.202506_28(6).0014
Journal volume & issue
Vol. 28, no. 6
pp. 1321 – 1329

Abstract

Read online

Cross-view image clustering (CIC) showcases immense potential in recognizing image patterns due to the power to aggregate information between views without labels. However, most CIC ignore intricate coupling relationships between category and redundant features in aggregating complementary information of cross-view images, which may restrict performance in recognizing patterns of images. To this end, a cross-view mutual information decoupling based deep generative clustering approach is proposed for recognizing image patterns (DMID-UIC), which contains information maximization deep generative module and self-supervised posterior inference module. Specifically, the former maximizes the mutual information between data and semantics within the generative adversarial network to decouple category and redundant features hidden in cross-view images, which guarantees the separation of different semantic distributions in the data space. The latter models the distribution fitting between generated data and the prior semantic code as a classification task, via treating partitioning results of common representations between views as self-supervised labels. Meanwhile, to better optimize the model, an EM optimization strategy is designed to enhance the above two module learning in an iterative manner. Finally, comprehensive results verify the superiority and effectiveness of DMID-UIC. DMID-UIC improves ACC by 10.1% on the Caltech 101-7 dataset, compared to the second-best result.

Keywords