Weakly Supervised Region Proposal Network and Object Detection

被引:114
|
作者
Tang, Peng [1 ]
Wang, Xinggang [1 ]
Wang, Angtian [1 ]
Yan, Yongluan [1 ]
Liu, Wenyu [1 ]
Huang, Junzhou [2 ,3 ]
Yuille, Alan [4 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
[2] Tencent AI Iab, Shenzhen, Peoples R China
[3] Univ Texas Arlington, Dept CSE, Arlington, TX 76019 USA
[4] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
来源
COMPUTER VISION - ECCV 2018, PT XI | 2018年 / 11215卷
关键词
Object detection; Region proposal; Weakly supervised learning; Convolutional neural network; LOCALIZATION;
D O I
10.1007/978-3-030-01252-6_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Convolutional Neural Network (CNN) based region proposal generation method (i.e. region proposal network), trained using bounding box annotations, is an essential component in modern fully supervised object detectors. However, Weakly Supervised Object Detection (WSOD) has not benefited from CNN-based proposal generation due to the absence of bounding box annotations, and is relying on standard proposal generation methods such as selective search. In this paper, we propose a weakly supervised region proposal network which is trained using only image-level annotations. The weakly supervised region proposal network consists of two stages. The first stage evaluates the objectness scores of sliding window boxes by exploiting the low-level information in CNN and the second stage refines the proposals from the first stage using a region-based CNN classifier. Our proposed region proposal network is suitable for WSOD, can be plugged into a WSOD network easily, and can share its convolutional computations with the WSOD network. Experiments on the PASCAL VOC and ImageNet detection datasets show that our method achieves the state-of-the-art performance for WSOD with performance gain of about 3% on average.
引用
收藏
页码:370 / 386
页数:17
相关论文
共 50 条
  • [1] PCL: Proposal Cluster Learning for Weakly Supervised Object Detection
    Tang, Peng
    Wang, Xinggang
    Bai, Song
    Shen, Wei
    Bai, Xiang
    Liu, Wenyu
    Yuille, Alan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (01) : 176 - 191
  • [2] Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection
    Lv, Pei
    Hu, Suqi
    Hao, Tianran
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6879 - 6892
  • [3] Learning an Invariant and Equivariant Network for Weakly Supervised Object Detection
    Feng, Xiaoxu
    Yao, Xiwen
    Shen, Hui
    Cheng, Gong
    Xiao, Bin
    Han, Junwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11977 - 11992
  • [4] Dynamic proposal sampling for weakly supervised object detection
    Jiang, Wenhui
    Zhao, Zhicheng
    Su, Fei
    Fang, Yuming
    NEUROCOMPUTING, 2021, 441 : 248 - 259
  • [5] Weakly Supervised Group Mask Network for Object Detection
    Song, Lingyun
    Liu, Jun
    Sun, Mingxuan
    Shang, Xuequn
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (03) : 681 - 702
  • [6] Online Active Proposal Set Generation for weakly supervised object detection
    Jin, Ruibing
    Lin, Guosheng
    Wen, Changyun
    KNOWLEDGE-BASED SYSTEMS, 2022, 237
  • [7] Motion Context Network for Weakly Supervised Object Detection in Videos
    Jin, Ruibing
    Lin, Guosheng
    Wen, Changyun
    Wang, Jianliang
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1864 - 1868
  • [8] Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection
    Han, Mingfei
    Wang, Yali
    Li, Mingjie
    Chang, Xiaojun
    Yang, Yi
    Qiao, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1560 - 1573
  • [9] Weakly Supervised Object Detection With Class Prototypical Network
    Li, Huifang
    Li, Yidong
    Cao, Yuanzhouhan
    Han, Yushan
    Jin, Yi
    Wei, Yunchao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1868 - 1878
  • [10] Weakly Supervised Group Mask Network for Object Detection
    Lingyun Song
    Jun Liu
    Mingxuan Sun
    Xuequn Shang
    International Journal of Computer Vision, 2021, 129 : 681 - 702