Dual-branch contrastive learning for weakly supervised object localizationDual-branch contrastive learning for weakly supervised object localizationZ. Guo et al.

被引:0
|
作者
Zebin Guo [1 ]
Dong Li [2 ]
Zhengjun Du [1 ]
Bingfeng Seng [2 ]
机构
[1] Qinghai University,School of Computer Technology and Application
[2] Intelligent Computing and Application Laboratory of Qinghai Province,undefined
关键词
Deep learning; Computer vision; Weakly supervised object localization; Dual-branch network; Contrastive learning;
D O I
10.1007/s10489-025-06514-1
中图分类号
学科分类号
摘要
The weakly supervised object localization task uses image-level labels to train object localization models. Traditional convolutional neural network (CNN)-based methods usually localize objects using a class activation map. However, the class activation map usually suffers from the problem of activating a small part of the object that is most discriminative. Meanwhile, the methods based on the Vision Transformer can capture long-range feature dependencies but tend to ignore local feature details. In this paper, we innovatively propose a dual-branch contrastive learning (DBC) method that consists of a Transformer and a CNN branch. The method can effectively separate the background and foreground of an image and fuse the features of Transformer and CNN through contrastive learning. Specifically, the method separates the background and foreground representations of the image using the initially generated class-agnostic activation maps. Then, the representations of the same image from different branches form positive pairs for contrastive learning. The background and foreground representations from the same branch form negative pairs. Finally, the DBC method forces the model to separate the background and foreground representations through negative contrastive loss and makes the model fuse the features of two branches through positive contrastive loss. Experiments on the ILSVRC benchmark show that the proposed method can achieve a Top-1 localization accuracy of 59.9% and a GT-known localization accuracy of 71.7%, which are better metrics than those of the state-of-the-art methods with the same parameter complexity.
引用
收藏
相关论文
共 50 条
  • [1] Dual-branch contrastive learning for weakly supervised object localization
    Guo, Zebin
    Li, Dong
    Du, Zhengjun
    Seng, Bingfeng
    APPLIED INTELLIGENCE, 2025, 55 (07)
  • [2] Object Discovery via Contrastive Learning for Weakly Supervised Object Detection
    Seo, Jinhwan
    Bae, Wonho
    Sutherland, Danica J.
    Noh, Junhyug
    Kim, Daijin
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 312 - 329
  • [3] Weakly-Supervised Contrastive Learning for Unsupervised Object Discovery
    Lv, Yunqiu
    Zhang, Jing
    Barnes, Nick
    Dai, Yuchao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2689 - 2702
  • [4] Weakly Supervised Contrastive Learning
    Zheng, Mingkai
    Wang, Fei
    You, Shan
    Qian, Chen
    Zhang, Changshui
    Wang, Xiaogang
    Xu, Chang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10022 - 10031
  • [5] Negative Prototypes Guided Contrastive Learning for Weakly Supervised Object Detection
    Zhang, Yu
    Zhu, Chuang
    Yang, Guoqing
    Chen, Siqi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 36 - 51
  • [6] Instance-Level Contrastive Learning for Weakly Supervised Object Detection
    Zhang, Ming
    Zeng, Bing
    SENSORS, 2022, 22 (19)
  • [7] Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation
    Ki, Minsong
    Uh, Youngjung
    Lee, Wonyoung
    Byun, Hyeran
    NEUROCOMPUTING, 2021, 445 : 244 - 254
  • [8] Weakly Supervised Region-Level Contrastive Learning for Efficient Object Detection
    Deng, Yuang
    Zhang, Yuhang
    Dai, Wenrui
    Zhang, Xiaopeng
    Xiong, Hongkai
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [9] Towards Precise Weakly Supervised Object Detection via Interactive Contrastive Learning of Context Information
    Lai, Qi
    Vong, Chi-Man
    Shi, Sai-Qi
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [10] Weakly Supervised Contrastive Learning for Unsupervised Vehicle Reidentification
    Yu, Jongmin
    Oh, Hyeontaek
    Kim, Minkyung
    Kim, Junsik
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15543 - 15553