Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships

被引:77
作者
Zhang, Dingwen [1 ]
Zeng, Wenyuan [1 ]
Yao, Jieru [1 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Brain & Artificial Intelligence Lab, Xian 710072, Peoples R China
基金
美国国家科学基金会;
关键词
Cognition; Proposals; Object detection; Supervised learning; Semantics; Task analysis; Network architecture; Weakly supervised object detection; multiple-instance learning; graphical convolutional network;
D O I
10.1109/TPAMI.2020.3046647
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, weakly supervised object detection has attracted great attention in the computer vision community. Although numerous deep learning-based approaches have been proposed in the past few years, such an ill-posed problem is still challenging and the learning performance is still behind the expectation. In fact, most of the existing approaches only consider the visual appearance of each proposal region but ignore to make use of the helpful context information. To this end, this paper introduces two levels of context into the weakly supervised learning framework. The first one is the proposal-level context, i.e., the relationship of the spatially adjacent proposals. The second one is the semantic-level context, i.e., the relationship of the co-occurring object categories. Therefore, the proposed weakly supervised learning framework contains not only the cognition process on the visual appearance but also the reasoning process on the proposal- and semantic-level relationships, which leads to the novel deep multiple instance reasoning framework. Specifically, built upon a conventional CNN-based network architecture, the proposed framework is equipped with two additional graph convolutional network-based reasoning models to implement object location reasoning and multi-label reasoning within an end-to-end network training procedure. Comprehensive experiments on the widely used PASCAL VOC and MS COCO benchmarks have been implemented, which demonstrate the superior capacity of the proposed approach when compared with other state-of-the-art methods and baseline models.
引用
收藏
页码:3349 / 3363
页数:15
相关论文
共 58 条
  • [1] Dissimilarity Coefficient based Weakly Supervised Object Detection
    Arun, Aditya
    Jawahar, C., V
    Kumar, M. Pawan
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9424 - 9433
  • [2] Weakly Supervised Deep Detection Networks
    Bilen, Hakan
    Vedaldi, Andrea
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2846 - 2854
  • [3] Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition
    Chen, Tianshui
    Xu, Muxin
    Hui, Xiaolu
    Wu, Hefeng
    Lin, Liang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 522 - 531
  • [4] Learning Implicit Fields for Generative Shape Modeling
    Chen, Zhiqin
    Zhang, Hao
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5932 - 5941
  • [5] Multi-fold MIL Training for Weakly Supervised Object Localization
    Cinbis, Ramazan Gokberk
    Verbeek, Jakob
    Schmid, Cordelia
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2409 - 2416
  • [6] Weakly Supervised Localization and Learning with Generic Knowledge
    Deselaers, Thomas
    Alexe, Bogdan
    Ferrari, Vittorio
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 100 (03) : 275 - 293
  • [7] Weakly Supervised Cascaded Convolutional Networks
    Diba, Ali
    Sharma, Vivek
    Pazandeh, Ali
    Pirsiavash, Hamed
    Van Gool, Luc
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5131 - 5139
  • [8] Learning a Deep ConvNet for Multi-label Classification with Partial Labels
    Durand, Thibaut
    Mehrasa, Nazanin
    Mori, Greg
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 647 - 657
  • [9] Graph Convolutional Tracking
    Gao, Junyu
    Zhang, Tianzhu
    Xu, Changsheng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4644 - 4654
  • [10] C-WSL: Count-Guided Weakly Supervised Localization
    Gao, Mingfei
    Li, Ang
    Yu, Ruichi
    Morariu, Vlad, I
    Davis, Larry S.
    [J]. COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 155 - 171