Contrastive Multi-View Learning for 3D Shape Clustering

被引：8

作者：

Peng, Bo ^{[1
]}

Lin, Guoting ^{[1
]}

Lei, Jianjun ^{[1
]}

Qin, Tianyi ^{[1
]}

Cao, Xiaochun ^{[2
]}

Ling, Nam ^{[3
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Peoples R China

[3] Santa Clara Univ, Dept Comp Sci & Engn, Santa Clara, CA 95053 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

3D shape clustering; multi-view learning; contrastive learning; graph construction;

D O I：

10.1109/TMM.2023.3347842

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Unsupervised 3D shape clustering is emerging as a promising research topic in multimedia and computer vision field. Considering the flexibility of acquiring multiple views for 3D shapes, this paper proposes a contrastive multi-view learning network (CMVL-Net) to cluster unlabeled 3D shapes from multiple views. To the best of our knowledge, this is the first multi-view-oriented 3D shape deep clustering method. The key to this method lies in how to capture highly discriminative 3D shape features suitable for clustering. By exploring consistency and complementarity among multiple views, a cross-view contrastive clustering mechanism is proposed to learn clustering-specified discriminative 3D shape features. To obtain a more compact 3D shape clustering structure, a consensus graph-guided contrastive constraint is designed to encourage cluster-wise consistency learning under the guidance of potential category associations among shapes. Experimental results on two widely used benchmark datasets demonstrate the effectiveness of the proposed method.

引用

页码：6262 / 6272

页数：11

共 55 条

[1] Deep Multimodal Subspace Clustering Networks [J].

Abavisani, Mahdi ;

Patel, Vishal M. .

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (06) :1601-1614

[2]

Achlioptas P, 2018, PR MACH LEARN RES, V80

[3] Constrained Multi-View Video Face Clustering [J].

Cao, Xiaochun ;

Zhang, Changqing ;

Zhou, Chengju ;

Fu, Huazhu ;

Foroosh, Hassan .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) :4381-4393

[4] Deep Clustering for Unsupervised Learning of Visual Features [J].

Caron, Mathilde ;

Bojanowski, Piotr ;

Joulin, Armand ;

Douze, Matthijs .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156

[5]

Chang JL, 2017, IEEE I CONF COMP VIS, P5880, DOI [10.1109/ICCV.2017.626, 10.1109/ICCV.2017.627]

[6]

Chen T, 2020, PR MACH LEARN RES, V119

[7] Learning Implicit Fields for Generative Shape Modeling [J].

Chen, Zhiqin ;

Zhang, Hao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5932-5941

[8] NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].

COVER, TM ;

HART, PE .

IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+

[9] Clustering-Driven Deep Embedding With Pairwise Constraints [J].

Fogel, Sharon ;

Averbuch-Elor, Hadar ;

Goldberger, Jacob ;

Cohen-Or, Daniel .

IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2019, 39 (04) :16-27

[10] MVTN: Multi-View Transformation Network for 3D Shape Recognition [J].

Hamdi, Abdullah ;

Giancola, Silvio ;

Ghanem, Bernard .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :1-11

← 1 2 3 4 5 6 →