Dual-branch contrastive learning for weakly supervised object localizationDual-branch contrastive learning for weakly supervised object localizationZ. Guo et al.

被引：0

作者：

Zebin Guo ^{[1
]}

Dong Li ^{[2
]}

Zhengjun Du ^{[1
]}

Bingfeng Seng ^{[2
]}

机构：

[1] Qinghai University,School of Computer Technology and Application

[2] Intelligent Computing and Application Laboratory of Qinghai Province,undefined

来源：

Applied Intelligence | 2025年 / 55卷 / 7期

关键词：

Deep learning; Computer vision; Weakly supervised object localization; Dual-branch network; Contrastive learning;

D O I：

10.1007/s10489-025-06514-1

中图分类号：

学科分类号：

摘要：

The weakly supervised object localization task uses image-level labels to train object localization models. Traditional convolutional neural network (CNN)-based methods usually localize objects using a class activation map. However, the class activation map usually suffers from the problem of activating a small part of the object that is most discriminative. Meanwhile, the methods based on the Vision Transformer can capture long-range feature dependencies but tend to ignore local feature details. In this paper, we innovatively propose a dual-branch contrastive learning (DBC) method that consists of a Transformer and a CNN branch. The method can effectively separate the background and foreground of an image and fuse the features of Transformer and CNN through contrastive learning. Specifically, the method separates the background and foreground representations of the image using the initially generated class-agnostic activation maps. Then, the representations of the same image from different branches form positive pairs for contrastive learning. The background and foreground representations from the same branch form negative pairs. Finally, the DBC method forces the model to separate the background and foreground representations through negative contrastive loss and makes the model fuse the features of two branches through positive contrastive loss. Experiments on the ILSVRC benchmark show that the proposed method can achieve a Top-1 localization accuracy of 59.9% and a GT-known localization accuracy of 71.7%, which are better metrics than those of the state-of-the-art methods with the same parameter complexity.

引用

共 50 条

[41] Learning an Invariant and Equivariant Network for Weakly Supervised Object Detection
Feng, Xiaoxu
Yao, Xiwen
Shen, Hui
Cheng, Gong
Xiao, Bin
Han, Junwei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11977 - 11992
[42] Enhanced Spatial Feature Learning for Weakly Supervised Object Detection
Wu, Zhihao
Wen, Jie
Xu, Yong
Yang, Jian
Li, Xuelong
Zhang, David
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 961 - 972
[43] Self Paced Deep Learning for Weakly Supervised Object Detection
Sangineto, Enver
Nabi, Moin
Culibrk, Dubravko
Sebe, Nicu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (03) : 712 - 725
[44] Active Learning Strategies for Weakly-Supervised Object Detection
Vo, Huy V.
Simeoni, Oriane
Gidaris, Spyros
Bursuc, Andrei
Perez, Patrick
Ponce, Jean
COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 211 - 230
[45] Learning dynamic background for weakly supervised moving object detection
Zhang, Zhijun
Chang, Yi
Zhong, Sheng
Yan, Luxin
Zou, Xu
IMAGE AND VISION COMPUTING, 2022, 121
[46] An Improved Adaptive Angle Weakly Supervised Learning Object Detection
Chen, Yantong
Shi, Yuxin
Ren, Jianzhao
Li, Jiabao
2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 494 - 503
[47] Deep patch learning for weakly supervised object classification and discovery
Tang, Peng
Wang, Xinggang
Huang, Zilong
Bai, Xiang
Liu, Wenyu
PATTERN RECOGNITION, 2017, 71 : 446 - 459
[48] Weakly-Supervised Domain Adaptive Semantic Segmentation with Prototypical Contrastive Learning
Das, Anurag
Xian, Yongqin
Dai, Dengxin
Schiele, Bernt
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15434 - 15443
[49] Two-Phase Learning for Weakly Supervised Object Localization
Kim, Dahun
Cho, Donghyeon
Yoo, Donggeun
Kweon, In So
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3554 - 3563
[50] ALWOD: Active Learning for Weakly-Supervised Object Detection
Wang, Yuting
Ilic, Velibor
Li, Jiatong
Kisacanin, Branislav
Pavlovic, Vladimir
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6436 - 6446

← 1 2 3 4 5 →