Self-produced Guidance for Weakly-Supervised Object Localization

被引：163

作者：

Zhang, Xiaolin ^{[1
]}

Wei, Yunchao ^{[2
]}

Kang, Guoliang ^{[1
]}

Yang, Yi ^{[1
]}

Huang, Thomas ^{[2
]}

机构：

[1] Univ Technol Sydney, CAI, Ultimo, NSW, Australia

[2] Univ Illinois, Champaign, IL USA

来源：

COMPUTER VISION - ECCV 2018, PT XII | 2018年 / 11216卷

关键词：

Object localization; Weakly Supervised Learning;

D O I：

10.1007/978-3-030-01258-8_37

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised methods usually generate localization results based on attention maps produced by classification networks. However, the attention maps exhibit the most discriminative parts of the object which are small and sparse. We propose to generate Self-produced Guidance (SPG) masks which separate the foreground i.e., the object of interest, from the background to provide the classification networks with spatial correlation information of pixels. A stagewise approach is proposed to incorporate high confident object regions to learn the SPG masks. The high confident regions within attention maps are utilized to progressively learn the SPG masks. The masks are then used as an auxiliary pixel-level supervision to facilitate the training of classification networks. Extensive experiments on ILSVRC demonstrate that SPG is effective in producing high-quality object localizations maps. Particularly, the proposed SPG achieves the Top-1 localization error rate of 43.83% on the ILSVRC validation set, which is a new state-of-the-art error rate.

引用

页码：610 / 625

页数：16

共 42 条

[1] [Anonymous], 2014, INT C LEARN REPR
[2] [Anonymous], 2018, IEEE CVPR
[3] [Anonymous], 2017, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2017.120
[4] Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks
Cao, Chunshui
Liu, Xianming
Yang, Yi
Yu, Yinan
Wang, Jiang
Wang, Zilei
Huang, Yongzhen
Wang, Liang
Huang, Chang
Xu, Wei
Ramanan, Deva
Huang, Thomas S.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2956 - 2964
[5] Chen Y.-C., 2017, Biostatistics & Epidemiology, V1, P161, DOI [DOI 10.1080/24709360.2017.1396742, 10.1080/24709360.2017.1396742]
[6] Revisiting RCNN: On Awakening the Classification Power of Faster RCNN
Cheng, Bowen
Wei, Yunchao
Shi, Honghui
Feris, Rogerio
Xiong, Jinjun
Huang, Thomas
[J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 473 - 490
[7] Weakly Supervised Cascaded Convolutional Networks
Diba, Ali
Sharma, Vivek
Pazandeh, Ali
Pirsiavash, Hamed
Van Gool, Luc
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5131 - 5139
[8] A Dual-Network Progressive Approach to Weakly Supervised Object Detection
Dong, Xuanyi
Meng, Deyu
Ma, Fan
Yang, Yi
[J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 279 - 287
[9] More is Less: A More Complicated Network with Less Inference Complexity
Dong, Xuanyi
Huang, Junshi
Yang, Yi
Yan, Shuicheng
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1895 - 1903
[10] Deeply Supervised Salient Object Detection with Short Connections
Hou, Qibin
Cheng, Ming-Ming
Hu, Xiaowei
Borji, Ali
Tu, Zhuowen
Torr, Philip
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5300 - 5309

← 1 2 3 4 5 →