Robust augmentation-based contrastive clustering with negative data mining

被引:0
|
作者
Sun, Liu [1 ]
He, Ming [1 ]
Qin, Shuai [1 ]
Wang, Nianbin [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, 145 Nantong St, Harbin 150001, Heilongjiang, Peoples R China
关键词
Contrastive clustering; Representation learning; Unsupervised learning; Image clustering;
D O I
10.1016/j.engappai.2025.110414
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Contrastive learning has been widely applied in unsupervised clustering tasks due to its superior performance in learning discriminative representations. However, the data augmentations commonly used in current contrastive clustering methods tend to combine weak and strong augmentations, which make it challenging to produce truly hard augmented examples due to the property of linear transformations. In addition, the simplicity of negative sample setting in contrastive learning generally leads to false positive samples, which aggravates the clustering performance. To alleviate these concerns, we propose a contrastive clustering method based on robust augmentation and negative data mining. Firstly, a robust augmentation method is introduced, which combines nonlinear augmentations with traditional linear augmentation methods to form contrastive pairs, to compensate for the limitations of linear augmentation in capturing complex features. Subsequently, we propose a hypothesis, i.e., if numerous samples of one class which widely separated from its cluster prototype are closest to the other cluster prototype, the two classes are considered to be the visual confusion categories (VCCs). Obviously, VCCs are particularly prone to boundary overlap in the clustering process. Therefore, we employ a confidence estimation criterion to obtain pseudo labels and increase the weight of negative samples/categories with VCCs labels, thereby enhancing the quality of clustering decision boundaries. Comprehensive experiments performed on four commonly used image benchmarks show that the proposed method is efficient.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Multimodal fake news detection through data augmentation-based contrastive learning
    Hua, Jiaheng
    Cui, Xiaodong
    Li, Xianghua
    Tang, Keke
    Zhu, Peican
    APPLIED SOFT COMPUTING, 2023, 136
  • [2] INTransformer: Data augmentation-based contrastive learning by injecting noise into transformer for molecular property prediction
    Jiang, Jing
    Li, Yachao
    Zhang, Ruisheng
    Liu, Yunwu
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2024, 128
  • [3] Semantic Contrastive Clustering with Federated Data Augmentation
    Wang, Qihong
    Jia, Hongjie
    Huang, Longxia
    Mao, Qirong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (06): : 1511 - 1524
  • [4] Data Augmentation-Based Photovoltaic Power Prediction
    Wang, Xifeng
    Shen, Yijun
    Song, Haiyu
    Liu, Shichao
    ENERGIES, 2025, 18 (03)
  • [5] Knowledge augmentation-based soft constraints for semi-supervised clustering
    Zhang, Zhanhu
    Yu, Xia
    Tao, Rui
    Zhang, Xinyu
    Li, Hongru
    Lu, Jingyi
    Zhou, Jian
    APPLIED SOFT COMPUTING, 2023, 144
  • [6] Data augmentation-based statistical inference of diffusion processes
    Wang, Yasen
    Cheng, Cheng
    Sun, Hongwei
    Jin, Junyang
    Fang, Huazhen
    CHAOS, 2023, 33 (03)
  • [7] Data Augmentation-based Prognostics for Predictive Maintenance of Industrial System
    Gay, Antonin
    Voisin, Alexandre
    Iung, Benoit
    Do, Phuc
    Bonidal, Remi
    Khelassi, Ahmed
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2022, 71 (01) : 409 - 412
  • [8] Data Augmentation-Based Joint Learning for Heterogeneous Face Recognition
    Cao, Bing
    Wang, Nannan
    Li, Jie
    Gao, Xinbo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (06) : 1731 - 1743
  • [9] Data Augmentation-Based Enhancement for Efficient Network Traffic Classification
    Shin, Chang-Yui
    Choi, Yang-Seo
    Kim, Myung-Sup
    IEEE ACCESS, 2025, 13 : 6006 - 6028
  • [10] Novelty Detection via Contrastive Learning with Negative Data Augmentation
    Chen, Chengwei
    Xie, Yuan
    Lin, Shaohui
    Qiao, Ruizhi
    Zhou, Jian
    Tan, Xin
    Zhang, Yi
    Ma, Lizhuang
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 606 - 614