Uncertainty-Aware Graph-Guided Weakly Supervised Object Detection

被引:4
|
作者
Zhu, Yueyi [1 ]
Zhang, Yongqiang [1 ]
Ding, Mingli [1 ]
Zuo, Wangmeng [2 ]
机构
[1] Harbin Inst Technol HIT, Sch Instrument Sci & Engn, Harbin 150001, Peoples R China
[2] Harbin Inst Technol HIT, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
美国国家科学基金会;
关键词
Weakly supervised object detection; uncertainty-aware self-training; graph convolution network; LOCALIZATION; NETWORKS;
D O I
10.1109/TCSVT.2022.3232487
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Weakly supervised object detection is an important and challenging task in the computer vision community. In this paper, we treat weakly supervised object detection as a self-training learning task. Based on the framework of self-training, weakly supervised object detection has two uncertainties during training, i.e., the uncertainty of the pseudo labels and the uncertainty of bounding box regression. To this end, we propose an uncertainty-aware graph-guided self-training framework to eliminate these uncertainties. First, we adopt a precise positive and negative sampling strategy to generate pseudo labels to solve the problem of pseudo label uncertainty. Then, we design a weighted location refinement branch based on Bayesian uncertainty modeling to overcome the bounding box regression uncertainty. Moreover, the imbalance between classification and localization tasks prevents the model from generating the task-aware feature map, and redundant proposals, if not handled properly, also introduce uncertainty to the detector. To overcome this problem, we design a graph-guided module that not only balances the two tasks from the perspective of features but also makes full use of proposals. Furthermore, the relation graph of proposals is constructed by clustering proposals, and then, the graph convolution network (GCN) is applied to propagate information on the graph. Thus, accurate feature representations of the objects are obtained through the graph-guided module, and the classification and localization tasks can promote each other. Extensive experiments on the PASCAL VOC 2007 and 2012 datasets demonstrate the effectiveness of our framework, and we obtain 55.2% and 52.0% mAPs on VOC2007 and VOC2012, respectively, showing its superiority over the state-of-the-art approaches by a large margin.
引用
收藏
页码:3257 / 3269
页数:13
相关论文
共 50 条
  • [41] Min-Entropy Latent Model for Weakly Supervised Object Detection
    Wan, Fang
    Wei, Pengxu
    Jiao, Jianbin
    Han, Zhenjun
    Ye, Qixiang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1297 - 1306
  • [42] Leveraging orientation for weakly supervised object detection with application to firearm localization
    Iqbal, Javed
    Munir, Muhammad Akhtar
    Mahmood, Arif
    Ali, Afsheen Rafaqat
    Ali, Mohsen
    NEUROCOMPUTING, 2021, 440 : 310 - 320
  • [43] Instance-Level Contrastive Learning for Weakly Supervised Object Detection
    Zhang, Ming
    Zeng, Bing
    SENSORS, 2022, 22 (19)
  • [44] Weakly Supervised Attended Object Detection Using Gaze Data as Annotations
    Mazzamuto, Michele
    Ragusa, Francesco
    Furnari, Antonino
    Signorello, Giovanni
    Farinella, Giovanni Maria
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 263 - 274
  • [45] Recurrent self-optimizing proposals for weakly supervised object detection
    Ming Zhang
    Bing Zeng
    Neural Computing and Applications, 2023, 35 : 757 - 771
  • [46] A Dual-Network Progressive Approach to Weakly Supervised Object Detection
    Dong, Xuanyi
    Meng, Deyu
    Ma, Fan
    Yang, Yi
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 279 - 287
  • [47] Recurrent self-optimizing proposals for weakly supervised object detection
    Zhang, Ming
    Zeng, Bing
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01) : 757 - 771
  • [48] HIERARCHICAL REGION PROPOSAL REFINEMENT NETWORK FOR WEAKLY SUPERVISED OBJECT DETECTION
    Zhang, Ming
    Liu, Shuaicheng
    Zeng, Bing
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 669 - 673
  • [49] A progressive learning framework based on single-instance annotation for weakly supervised object detection
    Zhang, Ming
    Zeng, Bing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 193
  • [50] Towards automatic visual inspection: A weakly supervised learning method for industrial applicable object detection
    Ge, Ce
    Wang, Jing
    Wang, Jingyu
    Qi, Qi
    Sun, Haifeng
    Liao, Jianxin
    COMPUTERS IN INDUSTRY, 2020, 121