Region-based dropout with attention prior for weakly supervised object localization

被引:21
作者
Choe, Junsuk [1 ]
Han, Dongyoon [1 ]
Yun, Sangdoo [1 ]
Ha, Jung-Woo [1 ]
Oh, Seong Joon [1 ]
Shim, Hyunjung [2 ]
机构
[1] NAVER AI Lab, Meylan, France
[2] Yonsei Univ, Sch Integrated Technol, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Deep learning; Object localization; Weakly supervised learning; Region-based dropout; Attention prior;
D O I
10.1016/j.patcog.2021.107949
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object localization (WSOL) methods utilize the internal feature responses of a classifier trained only on image-level labels. Classifiers tend to focus on the most discriminative part of the target object, instead of considering its full extent. Adversarial erasing (AE) techniques have been proposed to ameliorate this problem. These techniques erase the most discriminative part during training, thereby encouraging the classifiers to learn the less discriminative parts of the object. Despite the success of AE-based methods, we have observed that the hyperparameters fail to generalize across model architectures and datasets. Therefore, new sets of hyperparameters must be determined for each architecture and dataset. The selection of hyperparameters frequently requires strong supervision (e.g., pixel-level annotations or human inspection). Because WSOL is premised on the assumption that such strong supervision is absent, the applicability of AE-based methods is limited. In this paper, we propose the region -based dropout with attention prior (RDAP) algorithm, which features hyperparameter transferability. We combined AE with regional dropout algorithms that provide greater stability against the selection of hyperparameters. We empirically confirmed that the RDAP method achieved state-of-the-art localization accuracy on four architectures, namely VGG-GAP, InceptionV3, ResNet-50 SE, and PreResNet-18, and two datasets, namely CUB-200-2011 and ImageNet-1k, with a single set of hyperparameters. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 39 条
[1]  
[Anonymous], 2015, 3 INT C LEARN REPR
[2]   Weakly Supervised Deep Detection Networks [J].
Bilen, Hakan ;
Vedaldi, Andrea .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2846-2854
[3]   Evaluating Weakly Supervised Object Localization Methods Right [J].
Choe, Junsuk ;
Oh, Seong Joon ;
Lee, Seungho ;
Chun, Sanghyuk ;
Akata, Zeynep ;
Shim, Hyunjung .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3130-3139
[4]   Attention-based Dropout Layer for Weakly Supervised Object Localization [J].
Choe, Junsuk ;
Shim, Hyunjung .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2214-2223
[5]   Weakly Supervised Cascaded Convolutional Networks [J].
Diba, Ali ;
Sharma, Vivek ;
Pazandeh, Ali ;
Pirsiavash, Hamed ;
Van Gool, Luc .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5131-5139
[6]  
Ghiasi G, 2018, ADV NEUR IN, V31
[7]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[8]   Identity Mappings in Deep Residual Networks [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :630-645
[9]  
Hu J., 2018, CVPR
[10]   Weakly Supervised Object Boundaries [J].
Khoreva, Anna ;
Benenson, Rodrigo ;
Omran, Mohamed ;
Hein, Matthias ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :183-192