An Enhanced Feature Pyramid Object Detection Network for Autonomous Driving

被引：16

作者：

Wu, Yutian ^{[1
]}

Tang, Shuming ^{[2
]}

Zhang, Shuwei ^{[1
]}

Ogai, Harutoshi ^{[1
]}

机构：

[1] Waseda Univ, Grad Sch Informat Prod & Syst, Fukuoka, Fukuoka 8080135, Japan

[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 20期

关键词：

object detection; feature pyramid network; feature recalibration; context embedding; autonomous driving systems; augmented reality;

D O I：

10.3390/app9204363

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Feature Pyramid Network (FPN) builds a high-level semantic feature pyramid and detects objects of different scales in corresponding pyramid levels. Usually, features within the same pyramid levels have the same weight for subsequent object detection, which ignores the feature requirements of different scale objects. As we know, for most detection networks, it is hard to detect small objects and occluded objects because there is little information to exploit. To solve the above problems, we propose an Enhanced Feature Pyramid Object Detection Network (EFPN), which innovatively constructs an enhanced feature extraction subnet and adaptive parallel detection subnet. Enhanced feature extraction subnet introduces Feature Weight Module (FWM) to enhance pyramid features by weighting the fusion feature map. Adaptive parallel detection subnet introduces Adaptive Context Expansion (ACE) and Parallel Detection Branch (PDB). ACE aims to generate the features of adaptively enlarged object context region and original region. PDB predicts classification and regression results separately with the two features. Experiments showed that EFPN outperforms FPN in detection accuracy on Pascal VOC and KITTI datasets. Furthermore, the performance of EFPN meets the real-time requirements of autonomous driving systems.

引用

页数：15

共 27 条

[1] [Anonymous], P 3 INT C LEARNING R
[2] [Anonymous], 2017, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2017.322
[3] [Anonymous], 2012, P IEEE C COMP VIS PA
[4] [Anonymous], P C NEUR INF PROC SY
[5] [Anonymous], P BRIT MACH VIS C OX
[6] [Anonymous], 2016, P IEEE C COMP VIS PA
[7] [Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
[8] [Anonymous], 2016, ARXIV161206851
[9] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Bell, Sean
Zitnick, C. Lawrence
Bala, Kavita
Girshick, Ross
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2874 - 2883
[10] A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
Cai, Zhaowei
Fan, Quanfu
Feris, Rogerio S.
Vasconcelos, Nuno
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 354 - 370

← 1 2 3 →