Strongly augmented contrastive clustering

被引:49
作者
Deng, Xiaozhi [1 ]
Huang, Dong [1 ,2 ]
Chen, Ding-Hua [1 ]
Wang, Chang-Dong [3 ,4 ]
Lai, Jian-Huang [3 ,4 ]
机构
[1] South China Agr Univ, Coll Math & Informat, Guangzhou, Peoples R China
[2] Minist Agr & Rural Affairs, Key Lab Smart Agr Technol Trop South China, Beijing, Peoples R China
[3] Sun Yat sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[4] Guangdong Key Lab Informat Secur Technol, Guangdong, Peoples R China
关键词
Data clustering; Deep clustering; Image clustering; Contrastive learning; Deep neural network;
D O I
10.1016/j.patcog.2023.109470
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep clustering has attracted increasing attention in recent years due to its capability of joint representation learning and clustering via deep neural networks. In its latest developments, the contrastive learning has emerged as an effective technique to substantially enhance the deep clustering performance. However, the existing contrastive learning based deep clustering algorithms mostly focus on some carefullydesigned augmentations (often with limited transformations to preserve the structure), referred to as weak augmentations, but cannot go beyond the weak augmentations to explore the more opportunities in stronger augmentations (with more aggressive transformations or even severe distortions). In this paper, we present an end-to-end deep clustering approach termed S trongly A ugmented C ontrastive C lustering (SACC), which extends the conventional two-augmentation-view paradigm to multiple views and jointly leverages strong and weak augmentations for strengthened deep clustering. Particularly, we utilize a backbone network with triply-shared weights, where a strongly augmented view and two weakly augmented views are incorporated. Based on the representations produced by the backbone, the weak-weak view pair and the strong-weak view pairs are simultaneously exploited for the instance-level contrastive learning (via an instance projector) and the cluster-level contrastive learning (via a cluster projector), which, together with the backbone, can be jointly optimized in a purely unsupervised manner. Experimental results on five challenging image datasets have shown the superiority of our SACC approach over the state-of-the-art. The code is available at https://github.com/dengxiaozhi/SACC .(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 49 条
[1]  
Bengio Yoshua., 2007, ADV NEURAL INFORM PR, V19, P153
[2]  
Cai D, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P1010
[3]   Deep Clustering for Unsupervised Learning of Visual Features [J].
Caron, Mathilde ;
Bojanowski, Piotr ;
Joulin, Armand ;
Douze, Matthijs .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156
[4]  
Chang JL, 2019, Arxiv, DOI arXiv:1905.01681
[5]  
Chang JL, 2017, IEEE I CONF COMP VIS, P5880, DOI [10.1109/ICCV.2017.627, 10.1109/ICCV.2017.626]
[6]   D-TRACE: Deep Triply-Aligned Clustering [J].
Chen, Ding-Hua ;
Huang, Dong ;
Cheng, Haiyan ;
Wang, Chang-Dong .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 :526-537
[7]  
Chen T, 2020, PR MACH LEARN RES, V119
[8]  
Chen X, 2016, ADV NEUR IN, V29
[9]  
Coates A., 2011, AISTATS
[10]   Nearest Neighbor Matching for Deep Clustering [J].
Dang, Zhiyuan ;
Deng, Cheng ;
Yang, Xu ;
Wei, Kun ;
Huang, Heng .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13688-13697