Contrastive Multi-View Learning for 3D Shape Clustering

被引:6
作者
Peng, Bo [1 ]
Lin, Guoting [1 ]
Lei, Jianjun [1 ]
Qin, Tianyi [1 ]
Cao, Xiaochun [2 ]
Ling, Nam [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Peoples R China
[3] Santa Clara Univ, Dept Comp Sci & Engn, Santa Clara, CA 95053 USA
基金
中国国家自然科学基金;
关键词
3D shape clustering; multi-view learning; contrastive learning; graph construction;
D O I
10.1109/TMM.2023.3347842
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised 3D shape clustering is emerging as a promising research topic in multimedia and computer vision field. Considering the flexibility of acquiring multiple views for 3D shapes, this paper proposes a contrastive multi-view learning network (CMVL-Net) to cluster unlabeled 3D shapes from multiple views. To the best of our knowledge, this is the first multi-view-oriented 3D shape deep clustering method. The key to this method lies in how to capture highly discriminative 3D shape features suitable for clustering. By exploring consistency and complementarity among multiple views, a cross-view contrastive clustering mechanism is proposed to learn clustering-specified discriminative 3D shape features. To obtain a more compact 3D shape clustering structure, a consensus graph-guided contrastive constraint is designed to encourage cluster-wise consistency learning under the guidance of potential category associations among shapes. Experimental results on two widely used benchmark datasets demonstrate the effectiveness of the proposed method.
引用
收藏
页码:6262 / 6272
页数:11
相关论文
共 55 条
  • [31] Graph-Based Static 3D Point Clouds Geometry Coding
    Rente, Paulo de Oliveira
    Brites, Catarina
    Ascenso, Joao
    Pereira, Fernando
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (02) : 284 - 299
  • [32] Sanghi Aditya, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12374), P626, DOI 10.1007/978-3-030-58526-6_37
  • [33] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
  • [34] Variational Autoencoders for Deforming 3D Mesh Models
    Tan, Qingyang
    Gao, Lin
    Lai, Yu-Kun
    Xia, Shihong
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5841 - 5850
  • [35] Learning a Joint Affinity Graph for Multiview Subspace Clustering
    Tang, Chang
    Zhu, Xinzhong
    Liu, Xinwang
    Li, Miaomiao
    Wang, Pichao
    Zhang, Changqing
    Wang, Lizhe
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (07) : 1724 - 1736
  • [36] Reconsidering Representation Alignment for Multi-view Clustering
    Trosten, Daniel J.
    Lokse, Sigurd
    Jenssen, Robert
    Kampffmeyer, Michael
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1255 - 1265
  • [37] van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
  • [38] Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views
    Wei, Xin
    Gong, Yifei
    Wang, Fudong
    Sun, Xing
    Sun, Jian
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 397 - 406
  • [39] Wen C., 2023, IEEE Trans. Multimedia, DOI [10.1109/TMM.2023.3265177, DOI 10.1109/TMM.2023.3265177]
  • [40] Wu JJ, 2016, ADV NEUR IN, V29