Multi-Modal 3D Shape Clustering with Dual Contrastive Learning

Guoting Lin; Zexun Zheng; Lin Chen; Tianyi Qin; Jiahui Song

doi:10.3390/app12157384

Applied Sciences (Jul 2022)

Multi-Modal 3D Shape Clustering with Dual Contrastive Learning

Guoting Lin,
Zexun Zheng,
Lin Chen,
Tianyi Qin,
Jiahui Song

Affiliations

Guoting Lin: School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
Zexun Zheng: School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
Lin Chen: School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
Tianyi Qin: School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
Jiahui Song: School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China

DOI: https://doi.org/10.3390/app12157384
Journal volume & issue: Vol. 12, no. 15
p. 7384

Abstract

Read online

3D shape clustering is developing into an important research subject with the wide applications of 3D shapes in computer vision and multimedia fields. Since 3D shapes generally take on various modalities, how to comprehensively exploit the multi-modal properties to boost clustering performance has become a key issue for the 3D shape clustering task. Taking into account the advantages of multiple views and point clouds, this paper proposes the first multi-modal 3D shape clustering method, named the dual contrastive learning network (DCL-Net), to discover the clustering partitions of unlabeled 3D shapes. First, by simultaneously performing cross-view contrastive learning within multi-view modality and cross-modal contrastive learning between the point cloud and multi-view modalities in the representation space, a representation-level dual contrastive learning module is developed, which aims to capture discriminative 3D shape features for clustering. Meanwhile, an assignment-level dual contrastive learning module is designed by further ensuring the consistency of clustering assignments within the multi-view modality, as well as between the point cloud and multi-view modalities, thus obtaining more compact clustering partitions. Experiments on two commonly used 3D shape benchmarks demonstrate the effectiveness of the proposed DCL-Net.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords