Contrastive Multi-View Learning for 3D Shape Clustering

被引:8
作者
Peng, Bo [1 ]
Lin, Guoting [1 ]
Lei, Jianjun [1 ]
Qin, Tianyi [1 ]
Cao, Xiaochun [2 ]
Ling, Nam [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Peoples R China
[3] Santa Clara Univ, Dept Comp Sci & Engn, Santa Clara, CA 95053 USA
基金
中国国家自然科学基金;
关键词
3D shape clustering; multi-view learning; contrastive learning; graph construction;
D O I
10.1109/TMM.2023.3347842
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised 3D shape clustering is emerging as a promising research topic in multimedia and computer vision field. Considering the flexibility of acquiring multiple views for 3D shapes, this paper proposes a contrastive multi-view learning network (CMVL-Net) to cluster unlabeled 3D shapes from multiple views. To the best of our knowledge, this is the first multi-view-oriented 3D shape deep clustering method. The key to this method lies in how to capture highly discriminative 3D shape features suitable for clustering. By exploring consistency and complementarity among multiple views, a cross-view contrastive clustering mechanism is proposed to learn clustering-specified discriminative 3D shape features. To obtain a more compact 3D shape clustering structure, a consensus graph-guided contrastive constraint is designed to encourage cluster-wise consistency learning under the guidance of potential category associations among shapes. Experimental results on two widely used benchmark datasets demonstrate the effectiveness of the proposed method.
引用
收藏
页码:6262 / 6272
页数:11
相关论文
共 55 条
[31]   Graph-Based Static 3D Point Clouds Geometry Coding [J].
Rente, Paulo de Oliveira ;
Brites, Catarina ;
Ascenso, Joao ;
Pereira, Fernando .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (02) :284-299
[32]   Info3D: Representation Learning on 3D Objects Using Mutual Information Maximization and Contrastive Learning [J].
Sanghi, Aditya .
COMPUTER VISION - ECCV 2020, PT XXIX, 2020, 12374 :626-642
[33]   Multi-view Convolutional Neural Networks for 3D Shape Recognition [J].
Su, Hang ;
Maji, Subhransu ;
Kalogerakis, Evangelos ;
Learned-Miller, Erik .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :945-953
[34]   Variational Autoencoders for Deforming 3D Mesh Models [J].
Tan, Qingyang ;
Gao, Lin ;
Lai, Yu-Kun ;
Xia, Shihong .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5841-5850
[35]   Learning a Joint Affinity Graph for Multiview Subspace Clustering [J].
Tang, Chang ;
Zhu, Xinzhong ;
Liu, Xinwang ;
Li, Miaomiao ;
Wang, Pichao ;
Zhang, Changqing ;
Wang, Lizhe .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (07) :1724-1736
[36]   Contrastive Multiview Coding [J].
Tian, Yonglong ;
Krishnan, Dilip ;
Isola, Phillip .
COMPUTER VISION - ECCV 2020, PT XI, 2020, 12356 :776-794
[37]   Reconsidering Representation Alignment for Multi-view Clustering [J].
Trosten, Daniel J. ;
Lokse, Sigurd ;
Jenssen, Robert ;
Kampffmeyer, Michael .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1255-1265
[38]  
van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[39]   Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views [J].
Wei, Xin ;
Gong, Yifei ;
Wang, Fudong ;
Sun, Xing ;
Sun, Jian .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :397-406
[40]   3D Shape Contrastive Representation Learning With Adversarial Examples [J].
Wen, Congcong ;
Li, Xiang ;
Huang, Hao ;
Liu, Yu-Shen ;
Fang, Yi .
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 :679-692