Adaptive Zone Learning for Weakly Supervised Object Localization

被引:0
作者
Chen, Zhiwei [1 ]
Wang, Siwei [1 ]
Cao, Liujuan [1 ]
Shen, Yunhang [2 ]
Ji, Rongrong [1 ]
机构
[1] Xiamen Univ, Key Lab Multimedia Trusted Percept & Efficient Com, Minist Educ China, Xiamen 361005, Peoples R China
[2] Tencent Co Ltd, YouTu Lab, Shanghai 518064, Peoples R China
关键词
Location awareness; Feature extraction; Learning systems; Cams; Task analysis; Annotations; Generators; Class activation maps (CAMs); foreground prediction maps (FPMs); object localization; weakly supervised learning; NETWORK;
D O I
10.1109/TNNLS.2024.3392948
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object localization (WSOL) stands as a pivotal endeavor within the realm of computer vision, entailing the location of objects utilizing merely image-level labels. Contemporary approaches in WSOL have leveraged FPMs, yielding commendable outcomes. However, these existing FPM-based techniques are predominantly confined to rudimentary strategies of either augmenting the foreground or diminishing the background presence. We argue for the exploration and exploitation of the intricate interplay between the object's foreground and its background to achieve efficient object localization. In this manuscript, we introduce an innovative framework, termed adaptive zone learning (AZL), which operates on a coarse-to-fine basis to refine FPMs through a triad of adaptive zone mechanisms. First, an adversarial learning mechanism (ALM) is employed, orchestrating an interplay between the foreground and background regions. This mechanism accentuates coarse-grained object regions in a mutually adversarial manner. Subsequently, an oriented learning mechanism (OLM) is unveiled, which harnesses local insights from both foreground and background in a fine-grained manner. This mechanism is instrumental in delineating object regions with greater granularity, thereby generating better FPMs. Furthermore, we propose a reinforced learning mechanism (RLM) as the compensatory mechanism for adversarial design, by which the undesirable foreground maps are refined again. Extensive experiments on CUB-200-2011 and ILSVRC datasets demonstrate that AZL achieves significant and consistent performance improvements over other state-of-the-art WSOL methods.
引用
收藏
页码:7211 / 7224
页数:14
相关论文
共 79 条
[1]  
[Anonymous], 2015, CVPR
[2]   Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration [J].
Bai, Haotian ;
Zhang, Ruimao ;
Wang, Jiong ;
Wan, Xiang .
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 :612-628
[3]   Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].
Chattopadhay, Aditya ;
Sarkar, Anirban ;
Howlader, Prantik ;
Balasubramanian, Vineeth N. .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847
[4]   SPA2Net: Structure-Preserved Attention Activated Network for Weakly Supervised Object Localization [J].
Chen, Dong ;
Pan, Xingjia ;
Tang, Fan ;
Dong, Weiming ;
Xu, Changsheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :5779-5793
[5]   Reverse Attention-Based Residual Network for Salient Object Detection [J].
Chen, Shuhan ;
Tan, Xiuli ;
Wang, Ben ;
Lu, Huchuan ;
Hu, Xuelong ;
Fu, Yun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3763-3776
[6]   HCE: Hierarchical Context Embedding for Region-Based Object Detection [J].
Chen, Zhao-Min ;
Jin, Xin ;
Zhao, Bo-Rui ;
Zhang, Xiaoqin ;
Guo, Yanwen .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :6917-6929
[7]   Digital Compatible Synthesis, Placement and Implementation of Mixed-Signal Time-Domain Computing [J].
Chen, Zhengyu ;
Zhou, Hai ;
Gu, Jie .
PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,
[8]   Category-aware Allocation Transformer for Weakly Supervised Object Localization [J].
Chen, Zhiwei ;
Ding, Jinren ;
Cao, Liujuan ;
Shen, Yunhang ;
Zhang, Shengchuan ;
Jiang, Guannan ;
Ji, Rongrong .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, :6620-6629
[9]   E2Net: Excitative-Expansile Learning forWeakly Supervised Object Localization [J].
Chen, Zhiwei ;
Cao, Liujuan ;
Shen, Yunhang ;
Lian, Feihong ;
Wu, Yongjian ;
Ji, Rongrong .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :573-581
[10]  
Chen ZW, 2022, AAAI CONF ARTIF INTE, P410