Dual-branch contrastive learning for weakly supervised object localizationDual-branch contrastive learning for weakly supervised object localizationZ. Guo et al.

被引:0
|
作者
Zebin Guo [1 ]
Dong Li [2 ]
Zhengjun Du [1 ]
Bingfeng Seng [2 ]
机构
[1] Qinghai University,School of Computer Technology and Application
[2] Intelligent Computing and Application Laboratory of Qinghai Province,undefined
关键词
Deep learning; Computer vision; Weakly supervised object localization; Dual-branch network; Contrastive learning;
D O I
10.1007/s10489-025-06514-1
中图分类号
学科分类号
摘要
The weakly supervised object localization task uses image-level labels to train object localization models. Traditional convolutional neural network (CNN)-based methods usually localize objects using a class activation map. However, the class activation map usually suffers from the problem of activating a small part of the object that is most discriminative. Meanwhile, the methods based on the Vision Transformer can capture long-range feature dependencies but tend to ignore local feature details. In this paper, we innovatively propose a dual-branch contrastive learning (DBC) method that consists of a Transformer and a CNN branch. The method can effectively separate the background and foreground of an image and fuse the features of Transformer and CNN through contrastive learning. Specifically, the method separates the background and foreground representations of the image using the initially generated class-agnostic activation maps. Then, the representations of the same image from different branches form positive pairs for contrastive learning. The background and foreground representations from the same branch form negative pairs. Finally, the DBC method forces the model to separate the background and foreground representations through negative contrastive loss and makes the model fuse the features of two branches through positive contrastive loss. Experiments on the ILSVRC benchmark show that the proposed method can achieve a Top-1 localization accuracy of 59.9% and a GT-known localization accuracy of 71.7%, which are better metrics than those of the state-of-the-art methods with the same parameter complexity.
引用
收藏
相关论文
共 50 条
  • [41] Learning an Invariant and Equivariant Network for Weakly Supervised Object Detection
    Feng, Xiaoxu
    Yao, Xiwen
    Shen, Hui
    Cheng, Gong
    Xiao, Bin
    Han, Junwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11977 - 11992
  • [42] Enhanced Spatial Feature Learning for Weakly Supervised Object Detection
    Wu, Zhihao
    Wen, Jie
    Xu, Yong
    Yang, Jian
    Li, Xuelong
    Zhang, David
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 961 - 972
  • [43] Self Paced Deep Learning for Weakly Supervised Object Detection
    Sangineto, Enver
    Nabi, Moin
    Culibrk, Dubravko
    Sebe, Nicu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (03) : 712 - 725
  • [44] Active Learning Strategies for Weakly-Supervised Object Detection
    Vo, Huy V.
    Simeoni, Oriane
    Gidaris, Spyros
    Bursuc, Andrei
    Perez, Patrick
    Ponce, Jean
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 211 - 230
  • [45] Learning dynamic background for weakly supervised moving object detection
    Zhang, Zhijun
    Chang, Yi
    Zhong, Sheng
    Yan, Luxin
    Zou, Xu
    IMAGE AND VISION COMPUTING, 2022, 121
  • [46] An Improved Adaptive Angle Weakly Supervised Learning Object Detection
    Chen, Yantong
    Shi, Yuxin
    Ren, Jianzhao
    Li, Jiabao
    2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 494 - 503
  • [47] Deep patch learning for weakly supervised object classification and discovery
    Tang, Peng
    Wang, Xinggang
    Huang, Zilong
    Bai, Xiang
    Liu, Wenyu
    PATTERN RECOGNITION, 2017, 71 : 446 - 459
  • [48] Weakly-Supervised Domain Adaptive Semantic Segmentation with Prototypical Contrastive Learning
    Das, Anurag
    Xian, Yongqin
    Dai, Dengxin
    Schiele, Bernt
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15434 - 15443
  • [49] Two-Phase Learning for Weakly Supervised Object Localization
    Kim, Dahun
    Cho, Donghyeon
    Yoo, Donggeun
    Kweon, In So
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3554 - 3563
  • [50] ALWOD: Active Learning for Weakly-Supervised Object Detection
    Wang, Yuting
    Ilic, Velibor
    Li, Jiatong
    Kisacanin, Branislav
    Pavlovic, Vladimir
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6436 - 6446