Dual-branch contrastive learning for weakly supervised object localizationDual-branch contrastive learning for weakly supervised object localizationZ. Guo et al.

被引:0
|
作者
Zebin Guo [1 ]
Dong Li [2 ]
Zhengjun Du [1 ]
Bingfeng Seng [2 ]
机构
[1] Qinghai University,School of Computer Technology and Application
[2] Intelligent Computing and Application Laboratory of Qinghai Province,undefined
关键词
Deep learning; Computer vision; Weakly supervised object localization; Dual-branch network; Contrastive learning;
D O I
10.1007/s10489-025-06514-1
中图分类号
学科分类号
摘要
The weakly supervised object localization task uses image-level labels to train object localization models. Traditional convolutional neural network (CNN)-based methods usually localize objects using a class activation map. However, the class activation map usually suffers from the problem of activating a small part of the object that is most discriminative. Meanwhile, the methods based on the Vision Transformer can capture long-range feature dependencies but tend to ignore local feature details. In this paper, we innovatively propose a dual-branch contrastive learning (DBC) method that consists of a Transformer and a CNN branch. The method can effectively separate the background and foreground of an image and fuse the features of Transformer and CNN through contrastive learning. Specifically, the method separates the background and foreground representations of the image using the initially generated class-agnostic activation maps. Then, the representations of the same image from different branches form positive pairs for contrastive learning. The background and foreground representations from the same branch form negative pairs. Finally, the DBC method forces the model to separate the background and foreground representations through negative contrastive loss and makes the model fuse the features of two branches through positive contrastive loss. Experiments on the ILSVRC benchmark show that the proposed method can achieve a Top-1 localization accuracy of 59.9% and a GT-known localization accuracy of 71.7%, which are better metrics than those of the state-of-the-art methods with the same parameter complexity.
引用
收藏
相关论文
共 50 条
  • [21] Consistent prototype contrastive learning for weakly supervised person search
    Lin, Huadong
    Yu, Xiaohan
    Zhang, Pengcheng
    Bai, Xiao
    Zheng, Jin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 105
  • [22] Weakly-supervised Temporal Path Representation Learning with Contrastive Curriculum Learning
    Yang, Sean Bin
    Guo, Chenjuan
    Hu, Jilin
    Yang, Bin
    Tang, Jian
    Jensen, Christian S.
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2873 - 2885
  • [23] Dense Supervised Dual-Aware Contrastive Learning for Airborne Laser Scanning Weakly Supervised Semantic Segmentation
    Luo, Ziwei
    Zeng, Tao
    Jiang, Xinyi
    Peng, Qingyu
    Ma, Ying
    Xie, Zhong
    Pan, Xiong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [24] Weakly Supervised Object Detection Based on Active Learning
    Xiao Wang
    Xiang Xiang
    Baochang Zhang
    Xuhui Liu
    Jianying Zheng
    QingLei Hu
    Neural Processing Letters, 2022, 54 : 5169 - 5183
  • [25] Weakly Supervised Object Localization with Latent Category Learning
    Wang, Chong
    Ren, Weiqiang
    Huang, Kaiqi
    Tan, Tieniu
    COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 431 - 445
  • [26] Hierarchical complementary learning for weakly supervised object localization
    Benassou, Sabrina Narimene
    Shi, Wuzhen
    Jiang, Feng
    Benzine, Abdallah
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 100
  • [27] Feature disparity learning for weakly supervised object localization
    Li, Bingfeng
    Ruan, Haohao
    Li, Xinwei
    Wang, Keping
    IMAGE AND VISION COMPUTING, 2024, 145
  • [28] Transfer Learning by Ranking for Weakly Supervised Object Annotation
    Shi, Zhiyuan
    Siva, Parthipan
    Xiang, Tao
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [29] Strengthen Learning Tolerance for Weakly Supervised Object Localization
    Guo, Guangyu
    Han, Junwei
    Wan, Fang
    Zhang, Dingwen
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7399 - 7408
  • [30] Weakly Supervised Object Detection Based on Active Learning
    Wang, Xiao
    Xiang, Xiang
    Zhang, Baochang
    Liu, Xuhui
    Zheng, Jianying
    Hu, Qinglei
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5169 - 5183