Self-produced Guidance for Weakly-Supervised Object Localization

被引:163
作者
Zhang, Xiaolin [1 ]
Wei, Yunchao [2 ]
Kang, Guoliang [1 ]
Yang, Yi [1 ]
Huang, Thomas [2 ]
机构
[1] Univ Technol Sydney, CAI, Ultimo, NSW, Australia
[2] Univ Illinois, Champaign, IL USA
来源
COMPUTER VISION - ECCV 2018, PT XII | 2018年 / 11216卷
关键词
Object localization; Weakly Supervised Learning;
D O I
10.1007/978-3-030-01258-8_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised methods usually generate localization results based on attention maps produced by classification networks. However, the attention maps exhibit the most discriminative parts of the object which are small and sparse. We propose to generate Self-produced Guidance (SPG) masks which separate the foreground i.e., the object of interest, from the background to provide the classification networks with spatial correlation information of pixels. A stagewise approach is proposed to incorporate high confident object regions to learn the SPG masks. The high confident regions within attention maps are utilized to progressively learn the SPG masks. The masks are then used as an auxiliary pixel-level supervision to facilitate the training of classification networks. Extensive experiments on ILSVRC demonstrate that SPG is effective in producing high-quality object localizations maps. Particularly, the proposed SPG achieves the Top-1 localization error rate of 43.83% on the ILSVRC validation set, which is a new state-of-the-art error rate.
引用
收藏
页码:610 / 625
页数:16
相关论文
共 42 条
  • [1] [Anonymous], 2014, INT C LEARN REPR
  • [2] [Anonymous], 2018, IEEE CVPR
  • [3] [Anonymous], 2017, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2017.120
  • [4] Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks
    Cao, Chunshui
    Liu, Xianming
    Yang, Yi
    Yu, Yinan
    Wang, Jiang
    Wang, Zilei
    Huang, Yongzhen
    Wang, Liang
    Huang, Chang
    Xu, Wei
    Ramanan, Deva
    Huang, Thomas S.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2956 - 2964
  • [5] Chen Y.-C., 2017, Biostatistics & Epidemiology, V1, P161, DOI [DOI 10.1080/24709360.2017.1396742, 10.1080/24709360.2017.1396742]
  • [6] Revisiting RCNN: On Awakening the Classification Power of Faster RCNN
    Cheng, Bowen
    Wei, Yunchao
    Shi, Honghui
    Feris, Rogerio
    Xiong, Jinjun
    Huang, Thomas
    [J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 473 - 490
  • [7] Weakly Supervised Cascaded Convolutional Networks
    Diba, Ali
    Sharma, Vivek
    Pazandeh, Ali
    Pirsiavash, Hamed
    Van Gool, Luc
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5131 - 5139
  • [8] A Dual-Network Progressive Approach to Weakly Supervised Object Detection
    Dong, Xuanyi
    Meng, Deyu
    Ma, Fan
    Yang, Yi
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 279 - 287
  • [9] More is Less: A More Complicated Network with Less Inference Complexity
    Dong, Xuanyi
    Huang, Junshi
    Yang, Yi
    Yan, Shuicheng
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1895 - 1903
  • [10] Deeply Supervised Salient Object Detection with Short Connections
    Hou, Qibin
    Cheng, Ming-Ming
    Hu, Xiaowei
    Borji, Ali
    Tu, Zhuowen
    Torr, Philip
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5300 - 5309