Hierarchical complementary learning for weakly supervised object localization

被引:0
|
作者
Benassou, Sabrina Narimene [1 ]
Shi, Wuzhen [2 ]
Jiang, Feng [1 ]
Benzine, Abdallah [3 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, 92 Xidazhi St, Harbin, Peoples R China
[2] Shenzhen Univ, Coll Elect & Informat Engn, 3688 Nanhai Ave, Shenzhen, Peoples R China
[3] Digeiz, AI Lab, 47 Rue Marcel Dassault, F-92100 Boulogne, France
基金
美国国家科学基金会;
关键词
Weakly supervised object localization; Class activation map; Complementary map; Fusion strategy;
D O I
10.1016/j.image.2021.116520
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Weakly supervised object localization (WSOL) is a challenging problem that aims to localize objects without ground-truth bounding boxes. A common approach is to train the model that generates a class activation map (CAM) to localize the discriminative features of the object. Unfortunately, the limitation of this method is that they detect just a part of the object and not the whole object. To solve this problem, previous works have removed some parts of the image (Zhang et al., 2018; Zhang et al., 2018; Singh and Lee, 2017; Choe and Shim, 2019) to force the model to detect the full object extent. However, these methods require one or many hyper-parameters to erase the appropriate pixels on the image, which could involve a loss of information. In this paper, we propose a Hierarchical Complementary Learning Network method (HCLNet) that helps the CNN to perform better on classification and localization. HCLNet uses a complementary CAM to generate multiple maps that detect different parts of the object. Unlike previous works, this method does not need any extra hyper-parameters, as well as does not introduce a big loss of information. In order to fuse these different maps, two different fusion strategies known as the addition strategy and the I-1-norm strategy have been used. These strategies allow to detect the whole object while excluding the background. Extensive experiments show that HCLNet obtains better performance than state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Diverse Complementary Part Mining for Weakly Supervised Object Localization
    Meng, Meng
    Zhang, Tianzhu
    Yang, Wenfei
    Zhao, Jian
    Zhang, Yongdong
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1774 - 1788
  • [2] Weakly Supervised Learning for Object Localization Based on an Attention Mechanism
    Park, Nojin
    Ko, Hanseok
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [3] Dual-branch contrastive learning for weakly supervised object localization
    Guo, Zebin
    Li, Dong
    Du, Zhengjun
    Seng, Bingfeng
    APPLIED INTELLIGENCE, 2025, 55 (07)
  • [4] Weakly supervised foreground learning for weakly supervised localization and detection
    Zhang, Chen -Lin
    Li, Yin
    Wu, Jianxin
    PATTERN RECOGNITION, 2023, 137
  • [5] Adversarial Transformers for Weakly Supervised Object Localization
    Meng, Meng
    Zhang, Tianzhu
    Zhang, Zhe
    Zhang, Yongdong
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 7130 - 7143
  • [6] SALIENCY AWARE: WEAKLY SUPERVISED OBJECT LOCALIZATION
    Chen, Yun-Chun
    Hsu, Winston H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1907 - 1911
  • [7] IMPROVING CLASS ACTIVATION MAP FOR WEAKLY SUPERVISED OBJECT LOCALIZATION
    Zhang, Zhenfei
    Chang, Ming-Ching
    But, Tien D.
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2624 - 2628
  • [8] Entropy guided adversarial model for weakly supervised object localization
    Benassou, Sabrina Narimene
    Shi, Wuzhen
    Jiang, Feng
    NEUROCOMPUTING, 2021, 429 : 60 - 68
  • [9] Weakly-Supervised Object Localization by Cutting Background with Deep Reinforcement Learning
    Zheng, Wu
    Zhang, Zhaoxiang
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 210 - 218
  • [10] Token Masking Transformer for Weakly Supervised Object Localization
    Xu, Wenhao
    Wang, Changwei
    Xu, Rongtao
    Xu, Shibiao
    Meng, Weiliang
    Zhang, Man
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 2059 - 2069