Contrastive Multi-View Learning for 3D Shape Clustering

被引:6
作者
Peng, Bo [1 ]
Lin, Guoting [1 ]
Lei, Jianjun [1 ]
Qin, Tianyi [1 ]
Cao, Xiaochun [2 ]
Ling, Nam [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Peoples R China
[3] Santa Clara Univ, Dept Comp Sci & Engn, Santa Clara, CA 95053 USA
基金
中国国家自然科学基金;
关键词
3D shape clustering; multi-view learning; contrastive learning; graph construction;
D O I
10.1109/TMM.2023.3347842
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised 3D shape clustering is emerging as a promising research topic in multimedia and computer vision field. Considering the flexibility of acquiring multiple views for 3D shapes, this paper proposes a contrastive multi-view learning network (CMVL-Net) to cluster unlabeled 3D shapes from multiple views. To the best of our knowledge, this is the first multi-view-oriented 3D shape deep clustering method. The key to this method lies in how to capture highly discriminative 3D shape features suitable for clustering. By exploring consistency and complementarity among multiple views, a cross-view contrastive clustering mechanism is proposed to learn clustering-specified discriminative 3D shape features. To obtain a more compact 3D shape clustering structure, a consensus graph-guided contrastive constraint is designed to encourage cluster-wise consistency learning under the guidance of potential category associations among shapes. Experimental results on two widely used benchmark datasets demonstrate the effectiveness of the proposed method.
引用
收藏
页码:6262 / 6272
页数:11
相关论文
共 55 条
  • [1] Deep Multimodal Subspace Clustering Networks
    Abavisani, Mahdi
    Patel, Vishal M.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (06) : 1601 - 1614
  • [2] Achlioptas P, 2018, PR MACH LEARN RES, V80
  • [3] Constrained Multi-View Video Face Clustering
    Cao, Xiaochun
    Zhang, Changqing
    Zhou, Chengju
    Fu, Huazhu
    Foroosh, Hassan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 4381 - 4393
  • [4] Deep Clustering for Unsupervised Learning of Visual Features
    Caron, Mathilde
    Bojanowski, Piotr
    Joulin, Armand
    Douze, Matthijs
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 139 - 156
  • [5] Chang JL, 2017, IEEE I CONF COMP VIS, P5880, DOI [10.1109/ICCV.2017.626, 10.1109/ICCV.2017.627]
  • [6] Chen T, 2020, PR MACH LEARN RES, V119
  • [7] Learning Implicit Fields for Generative Shape Modeling
    Chen, Zhiqin
    Zhang, Hao
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5932 - 5941
  • [8] NEAREST NEIGHBOR PATTERN CLASSIFICATION
    COVER, TM
    HART, PE
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) : 21 - +
  • [9] Clustering-Driven Deep Embedding With Pairwise Constraints
    Fogel, Sharon
    Averbuch-Elor, Hadar
    Goldberger, Jacob
    Cohen-Or, Daniel
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2019, 39 (04) : 16 - 27
  • [10] MVTN: Multi-View Transformation Network for 3D Shape Recognition
    Hamdi, Abdullah
    Giancola, Silvio
    Ghanem, Bernard
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1 - 11