An Enhanced Region Proposal Network for object detection using deep learning method

被引:20
作者
Chen, Yu Peng [1 ,2 ]
Li, Ying [1 ,2 ]
Wang, Gang [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun, Jilin, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
GRADIENTS;
D O I
10.1371/journal.pone.0203897
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Faster Region-based Convolutional Network (Faster R-CNN) is a state-of-the-art object detection method. However, the object detection effect of Faster R-CNN is not good based on the Region Proposal Network (RPN). Inspired by RPN of Faster R-CNN, we propose a novel proposal generation method called Enhanced Region Proposal Network (ERPN). Four improvements are presented in ERPN. Firstly, our proposed deconvolutional feature pyramid network (DFPN) is introduced to improve the quality of region proposals. Secondly, novel anchor boxes are designed with interspersed scales and adaptive aspect ratios. Thereafter, the capability of object localization is increased. Thirdly, a particle swarm optimization (PSO) based support vector machine (SVM), termed PSO-SVM, is developed to distinguish the positive and negative anchor boxes. Fourthly, the classification part of multitask loss function in RPN is improved. Consequently, the effect of classification loss is strengthened. In this study, our proposed ERPN is compared with five object detection methods on both PASCAL VOC and COCO data sets. For the VGG-16 model, our ERPN obtains 78.6% mAP on VOC 2007 data set, 74.4% mAP on VOC 2012 data set and 31.7% on COCO data set. The performance of ERPN is the best among the comparison object detection methods. Furthermore, the detection speed of ERPN is 5.8 fps. Additionally, ERPN obtains good effect on small object detection.
引用
收藏
页数:26
相关论文
共 46 条
[31]   G-CNN: Object Detection via Grid Convolutional Neural Network [J].
Lu, Qishuo ;
Liu, Chonghua ;
Jiang, Zhuqing ;
Men, Aidong ;
Yang, Bo .
IEEE ACCESS, 2017, 5 :24023-24031
[32]   Occlusion Patterns for Object Class Detection [J].
Pepik, Bojan ;
Stark, Michael ;
Gehler, Peter ;
Schiele, Bernt .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3286-3293
[33]   Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation [J].
Pont-Tuset, Jordi ;
Arbelaez, Pablo ;
Barron, Jonathan T. ;
Marques, Ferran ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (01) :128-140
[34]   Object Detection Networks on Convolutional Feature Maps [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Zhang, Xiangyu ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (07) :1476-1481
[35]   ImageNet Large Scale Visual Recognition Challenge [J].
Russakovsky, Olga ;
Deng, Jia ;
Su, Hao ;
Krause, Jonathan ;
Satheesh, Sanjeev ;
Ma, Sean ;
Huang, Zhiheng ;
Karpathy, Andrej ;
Khosla, Aditya ;
Bernstein, Michael ;
Berg, Alexander C. ;
Fei-Fei, Li .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252
[36]   Local grayvalue invariants for image retrieval [J].
Schmid, C ;
Mohr, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (05) :530-535
[37]   Deep learning in neural networks: An overview [J].
Schmidhuber, Juergen .
NEURAL NETWORKS, 2015, 61 :85-117
[38]   Training Region-based Object Detectors with Online Hard Example Mining [J].
Shrivastava, Abhinav ;
Gupta, Abhinav ;
Girshick, Ross .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :761-769
[39]  
Szegedy Christian, 2015, P IEEE C COMP VIS PA, P1, DOI [10.1109/cvpr.2015.7298594, DOI 10.1109/CVPR.2015.7298594]
[40]   Contextual priming for object detection [J].
Torralba, A .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 53 (02) :169-191