Semantic Contrastive Clustering with Federated Data Augmentation

被引:0
作者
Wang, Qihong [1 ]
Jia, Hongjie [1 ]
Huang, Longxia [1 ]
Mao, Qirong [1 ]
机构
[1] School of Computer Science and Communication Engineering, Jiangsu University, Jiangsu, Zhenjiang
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2024年 / 61卷 / 06期
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
clustering; contrastive learning; global category information; strong data augmentation; weak data augmentation;
D O I
10.7544/issn1000-1239.202220995
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Given the excellent performance of contrastive learning on downstream tasks, contrastive clustering has received much more attention recently. However, most approaches only utilize a simple kind of data augmentation. Although augmented views keep the majority of information from original samples, they also inherit a mixture of characteristic of features, including semantic and non-semantic features, which limits model’s learning ability of semantic information under similar or identical view patterns. Even some approaches regard two different augmentation views being from the same sample and keeping similar view patterns as positive pairs, which results in sample pairs lacking of semantics. In this paper, we propose a semantic contrastive clustering method with federated data augmentation to solve these problems. Two different types of data augmentations, namely strong data augmentation and weak data augmentation, are introduced to produce two very different view patterns. These two view patterns are utilized to mitigate the disturbance of non-semantic information and improve the semantic awareness of the proposed approach. Moreover, a global k-nearest neighbor graph is used to bring global category information, which instructs the model to treat different samples from the same cluster as positive pairs. Extensive experiments on six commonly used and challenging image datasets show that the proposed method achieves the state-of-the-art performance and confirms the superiority and validity of it. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1511 / 1524
页数:13
相关论文
共 50 条
  • [21] Federated Fuzzy Clustering for Longitudinal Health Data
    Balkus, Salvador V.
    Fang, Hua
    Wang, Honggang
    2022 IEEE/ACM CONFERENCE ON CONNECTED HEALTH: APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES (CHASE 2022), 2022, : 128 - 132
  • [22] Attributed graph clustering under the contrastive mechanism with cluster-preserving augmentation
    Zheng, Yimei
    Jia, Caiyan
    Yu, Jian
    INFORMATION SCIENCES, 2024, 681
  • [23] Prefix Data Augmentation for Contrastive Learning of Unsupervised Sentence Embedding
    Wang, Chunchun
    Lv, Shu
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [24] Contrastive Learning vs. Self-Learning vs. Deformable Data Augmentation in Semantic Segmentation of Medical Images
    Arabi, Hossein
    Zaidi, Habib
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (06): : 3217 - 3230
  • [25] Contrastive self-representation learning for data clustering
    Zhao, Wenhui
    Gao, Quanxue
    Mei, Shikun
    Yang, Ming
    NEURAL NETWORKS, 2023, 167 : 648 - 655
  • [26] CbDA: Contrastive-Based Data Augmentation for Domain Generalization
    Jiang, Ziyi
    Zhang, Liwen
    Liang, Xiaoxuan
    Chen, Zhenghan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [27] Unsupervised Contrastive Learning for Time Series Data Clustering
    Cao, Bo
    Xing, Qinghua
    Yang, Ke
    Wu, Xuan
    Li, Longyue
    ELECTRONICS, 2025, 14 (08):
  • [28] FedProc: Prototypical contrastive federated learning on non-IID data
    Mu, Xutong
    Shen, Yulong
    Cheng, Ke
    Geng, Xueli
    Fu, Jiaxuan
    Zhang, Tao
    Zhang, Zhiwei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 143 : 93 - 104
  • [29] Semantic Feature Graph Consistency with Contrastive Cluster Assignments for Multilingual Document Clustering
    Sun, Teng
    Shu, Zhenqiu
    Huang, Yuxin
    Wang, Hongbin
    Yu, Zhengtao
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2025, 24 (01)
  • [30] SUBSPACE CLUSTERING USING UNSUPERVISED DATA AUGMENTATION
    Abdolali, Maryam
    Gillis, Nicolas
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3868 - 3872