High-Quality R-CNN Object Detection Using Multi-Path Detection Calibration Network

被引:50
|
作者
Chen, Xiaoyu [1 ]
Li, Hongliang [1 ]
Wu, Qingbo [1 ]
Ngan, King Ngi [1 ]
Xu, Linfeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural networks (CNNs); deep learning; object detection; object recognition;
D O I
10.1109/TCSVT.2020.2987465
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object proposals are used in two-stage detectors, such as R-CNN, to generate detection results, including category predictions and refined bounding-boxes. As a result, classification scores are assigned to refined bounding-boxes rather than object proposals. However, this procedure ignores the discrepancy of data distribution between object proposals and refined bounding-boxes. We consider this discrepancy could limit the detection accuracy. Specifically, the foreground/background imbalance on object proposals and inaccurate information from low-IoU proposals could hinder the category prediction. In this paper, we propose a detector called the Multi-Path Detection Calibration Network (PDC-Net) to address this problem. The key idea behind PDC-Net is calibrating detection results from R-CNN by considering the statistical discrepancy between object proposals and refined bounding-boxes. PDC-Net is built on Faster R-CNN. The core component in PDC-Net is the multi-path detection head, in which the base detector (from Faster R-CNN) generates detection results from object proposals and multiple calibration detectors fix incorrect outputs from the base detector using refined bounding-boxes. Experiments reveal that PDC-Net can boost detection results. Our method could reach 83.1% and 43.3% mAP respectively on PASCAL VOC and MSCOCO benchmarks, which is comparable to several state-of-the-art methods.
引用
收藏
页码:715 / 727
页数:13
相关论文
共 50 条
  • [41] An accurate object detection of wood defects using an improved Faster R-CNN model
    Zou, Xianghe
    Wu, Chongyang
    Liu, Hongen
    Yu, Zhangwei
    Kuang, Xianyan
    WOOD MATERIAL SCIENCE & ENGINEERING, 2024,
  • [42] A newly proposed object detection method using Faster R-CNN Inception with ResNet based on Tensorflow
    Saha, Sujay
    Khabir, Kanij Mehtanin
    Abir, Shadman Saquib
    Islam, Ariful
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2019, 2019, 10996
  • [43] Urban Traffic Object Detection Based on Multi-Stage Proposal Sparse R-CNN
    Liu C.-Y.
    Zhang Y.-L.
    Bi X.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (01): : 26 - 31
  • [44] Multi-Path Dilated Residual Network for Nuclei Segmentation and Detection
    Wang, Eric Ke
    Zhang, Xun
    Pan, Leyun
    Cheng, Caixia
    Dimitrakopoulou-Strauss, Antonia
    Ni, Yueping
    Zhe, Nie
    CELLS, 2019, 8 (05)
  • [45] Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network
    Liu, Tianrui
    Stathaki, Tania
    FRONTIERS IN NEUROROBOTICS, 2018, 12
  • [46] Sparse R-CNN: An End-to-End Framework for Object Detection
    Sun, Peize
    Zhang, Rufeng
    Jiang, Yi
    Kong, Tao
    Xu, Chenfeng
    Zhan, Wei
    Tomizuka, Masayoshi
    Yuan, Zehuan
    Luo, Ping
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15650 - 15664
  • [47] DECONV R-CNN FOR SMALL OBJECT DETECTION ON REMOTE SENSING IMAGES
    Zhang, Wei
    Wang, Shihao
    Thachan, Sophanyouly
    Chen, Jingzhou
    Qian, Yuntao
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2483 - 2486
  • [48] Multi-wavelength Solar Event Detection using Faster R-CNN
    Kucuk, Ahmet
    Aydin, Berkay
    Angryk, Rafal
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 2552 - 2558
  • [49] Distributed Edge Cloud R-CNN for Real Time Object Detection
    Herrera, Joshua
    Demir, Mevlut A.
    Yousefi, Parsa
    Prevost, John J.
    Rad, Paul
    2018 WORLD AUTOMATION CONGRESS (WAC), 2018, : 146 - 151
  • [50] Foreign Object Detection of Transmission Lines Based on Faster R-CNN
    Guo, Shuqiang
    Bai, Qianlong
    Zhou, Xinxin
    INFORMATION SCIENCE AND APPLICATIONS, 2020, 621 : 269 - 275