Robust augmentation-based contrastive clustering with negative data mining

被引：0

作者：

Sun, Liu ^{[1
]}

He, Ming ^{[1
]}

Qin, Shuai ^{[1
]}

Wang, Nianbin ^{[1
]}

机构：

[1] Harbin Engn Univ, Coll Comp Sci & Technol, 145 Nantong St, Harbin 150001, Heilongjiang, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 148卷

关键词：

Contrastive clustering; Representation learning; Unsupervised learning; Image clustering;

D O I：

10.1016/j.engappai.2025.110414

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Contrastive learning has been widely applied in unsupervised clustering tasks due to its superior performance in learning discriminative representations. However, the data augmentations commonly used in current contrastive clustering methods tend to combine weak and strong augmentations, which make it challenging to produce truly hard augmented examples due to the property of linear transformations. In addition, the simplicity of negative sample setting in contrastive learning generally leads to false positive samples, which aggravates the clustering performance. To alleviate these concerns, we propose a contrastive clustering method based on robust augmentation and negative data mining. Firstly, a robust augmentation method is introduced, which combines nonlinear augmentations with traditional linear augmentation methods to form contrastive pairs, to compensate for the limitations of linear augmentation in capturing complex features. Subsequently, we propose a hypothesis, i.e., if numerous samples of one class which widely separated from its cluster prototype are closest to the other cluster prototype, the two classes are considered to be the visual confusion categories (VCCs). Obviously, VCCs are particularly prone to boundary overlap in the clustering process. Therefore, we employ a confidence estimation criterion to obtain pseudo labels and increase the weight of negative samples/categories with VCCs labels, thereby enhancing the quality of clustering decision boundaries. Comprehensive experiments performed on four commonly used image benchmarks show that the proposed method is efficient.

引用

页数：12

共 50 条

[1] Multimodal fake news detection through data augmentation-based contrastive learning
Hua, Jiaheng
Cui, Xiaodong
Li, Xianghua
Tang, Keke
Zhu, Peican
APPLIED SOFT COMPUTING, 2023, 136
[2] INTransformer: Data augmentation-based contrastive learning by injecting noise into transformer for molecular property prediction
Jiang, Jing
Li, Yachao
Zhang, Ruisheng
Liu, Yunwu
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2024, 128
[3] Semantic Contrastive Clustering with Federated Data Augmentation
Wang, Qihong
Jia, Hongjie
Huang, Longxia
Mao, Qirong
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (06): : 1511 - 1524
[4] Data Augmentation-Based Photovoltaic Power Prediction
Wang, Xifeng
Shen, Yijun
Song, Haiyu
Liu, Shichao
ENERGIES, 2025, 18 (03)
[5] Knowledge augmentation-based soft constraints for semi-supervised clustering
Zhang, Zhanhu
Yu, Xia
Tao, Rui
Zhang, Xinyu
Li, Hongru
Lu, Jingyi
Zhou, Jian
APPLIED SOFT COMPUTING, 2023, 144
[6] Data augmentation-based statistical inference of diffusion processes
Wang, Yasen
Cheng, Cheng
Sun, Hongwei
Jin, Junyang
Fang, Huazhen
CHAOS, 2023, 33 (03)
[7] Data Augmentation-based Prognostics for Predictive Maintenance of Industrial System
Gay, Antonin
Voisin, Alexandre
Iung, Benoit
Do, Phuc
Bonidal, Remi
Khelassi, Ahmed
CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2022, 71 (01) : 409 - 412
[8] Data Augmentation-Based Joint Learning for Heterogeneous Face Recognition
Cao, Bing
Wang, Nannan
Li, Jie
Gao, Xinbo
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (06) : 1731 - 1743
[9] Data Augmentation-Based Enhancement for Efficient Network Traffic Classification
Shin, Chang-Yui
Choi, Yang-Seo
Kim, Myung-Sup
IEEE ACCESS, 2025, 13 : 6006 - 6028
[10] Novelty Detection via Contrastive Learning with Negative Data Augmentation
Chen, Chengwei
Xie, Yuan
Lin, Shaohui
Qiao, Ruizhi
Zhou, Jian
Tan, Xin
Zhang, Yi
Ma, Lizhuang
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 606 - 614

← 1 2 3 4 5 →