Twin Contrastive Learning for Online Clustering

被引:92
作者
Li, Yunfan [1 ]
Yang, Mouxing [1 ]
Peng, Dezhong [1 ]
Li, Taihao [2 ]
Huang, Jiantao [2 ]
Peng, Xi [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
基金
国家重点研发计划;
关键词
Deep clustering; Online clustering; Unsupervised learning; Contrastive learning;
D O I
10.1007/s11263-022-01639-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes to perform online clustering by conducting twin contrastive learning (TCL) at the instance and cluster level. Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively. Based on the observation, for a given dataset, the proposed TCL first constructs positive and negative pairs through data augmentations. Thereafter, in the row and column space of the feature matrix, instance- and cluster-level contrastive learning are respectively conducted by pulling together positive pairs while pushing apart the negatives. To alleviate the influence of intrinsic false-negative pairs and rectify cluster assignments, we adopt a confidence-based criterion to select pseudo-labels for boosting both the instance- and cluster-level contrastive learning. As a result, the clustering performance is further improved. Besides the elegant idea of twin contrastive learning, another advantage of TCL is that it could independently predict the cluster assignment for each instance, thus effortlessly fitting online scenarios. Extensive experiments on six widely-used image and text benchmarks demonstrate the effectiveness of TCL. The code is released on.
引用
收藏
页码:2205 / 2221
页数:17
相关论文
共 86 条
[31]  
Hu WH, 2017, PR MACH LEARN RES, V70
[32]   The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation [J].
Huang, Junjie ;
Zhu, Zheng ;
Guo, Feng ;
Huang, Guan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5699-5708
[33]   COMPARING PARTITIONS [J].
HUBERT, L ;
ARABIE, P .
JOURNAL OF CLASSIFICATION, 1985, 2 (2-3) :193-218
[34]   Invariant Information Clustering for Unsupervised Image Classification and Segmentation [J].
Ji, Xu ;
Henriques, Joao F. ;
Vedaldi, Andrea .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9864-9873
[35]  
Khosla P., 2020, ADV NEURAL INFORM PR, V33, P18661, DOI 10.48550/arXiv.2004.11362
[36]   NLNL: Negative Learning for Noisy Labels [J].
Kim, Youngdong ;
Yim, Junho ;
Yun, Juseung ;
Kim, Junmo .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :101-110
[37]  
Kingma D. P., 2013, Auto-encoding variational bayes
[38]  
Kiros R., 2015, Advances in Neural Information Processing Systems, V28
[39]   Challenges in unsupervised clustering of single-cell RNA-seq data [J].
Kiselev, Vladimir Yu ;
Andrews, Tallulah S. ;
Hemberg, Martin .
NATURE REVIEWS GENETICS, 2019, 20 (05) :273-282
[40]  
Krizhevsky A., 2012, Handbook Syst. Autoimmune Dis