An Enhanced Feature Pyramid Object Detection Network for Autonomous Driving

被引:16
作者
Wu, Yutian [1 ]
Tang, Shuming [2 ]
Zhang, Shuwei [1 ]
Ogai, Harutoshi [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Fukuoka, Fukuoka 8080135, Japan
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 20期
关键词
object detection; feature pyramid network; feature recalibration; context embedding; autonomous driving systems; augmented reality;
D O I
10.3390/app9204363
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Feature Pyramid Network (FPN) builds a high-level semantic feature pyramid and detects objects of different scales in corresponding pyramid levels. Usually, features within the same pyramid levels have the same weight for subsequent object detection, which ignores the feature requirements of different scale objects. As we know, for most detection networks, it is hard to detect small objects and occluded objects because there is little information to exploit. To solve the above problems, we propose an Enhanced Feature Pyramid Object Detection Network (EFPN), which innovatively constructs an enhanced feature extraction subnet and adaptive parallel detection subnet. Enhanced feature extraction subnet introduces Feature Weight Module (FWM) to enhance pyramid features by weighting the fusion feature map. Adaptive parallel detection subnet introduces Adaptive Context Expansion (ACE) and Parallel Detection Branch (PDB). ACE aims to generate the features of adaptively enlarged object context region and original region. PDB predicts classification and regression results separately with the two features. Experiments showed that EFPN outperforms FPN in detection accuracy on Pascal VOC and KITTI datasets. Furthermore, the performance of EFPN meets the real-time requirements of autonomous driving systems.
引用
收藏
页数:15
相关论文
共 27 条
  • [1] [Anonymous], P 3 INT C LEARNING R
  • [2] [Anonymous], 2017, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2017.322
  • [3] [Anonymous], 2012, P IEEE C COMP VIS PA
  • [4] [Anonymous], P C NEUR INF PROC SY
  • [5] [Anonymous], P BRIT MACH VIS C OX
  • [6] [Anonymous], 2016, P IEEE C COMP VIS PA
  • [7] [Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
  • [8] [Anonymous], 2016, ARXIV161206851
  • [9] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
    Bell, Sean
    Zitnick, C. Lawrence
    Bala, Kavita
    Girshick, Ross
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2874 - 2883
  • [10] A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
    Cai, Zhaowei
    Fan, Quanfu
    Feris, Rogerio S.
    Vasconcelos, Nuno
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 354 - 370