HYPER FEATURE FUSION PYRAMID NETWORK FOR OBJECT DETECTION

被引:0
作者
Huang, Shouzhi [1 ]
Li, Xiaoyu [2 ]
Jiang, Zhuqing [1 ,3 ]
Guo, Xiaoqiang [2 ]
Men, Aidong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
[2] Acad Broadcasting Sci, Beijing, Peoples R China
[3] Beijing Key Lab Network Syst & Network Culture, Beijing, Peoples R China
来源
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018) | 2018年
基金
美国国家科学基金会;
关键词
Convolutional Neural Network; feature fusion; feature pyramids; object detection;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present Hyper Feature Fusion Pyramid Network (HFFP-net), an efficient framework for object detection which firstly fuses features from multiple layers and then builds a new branch to construct feature pyramids based on the fused feature. HFFPnet reuses the hierarchical features from multiple layers of convolutional neural networks to build feature pyramids to construct semantic features at all levels so that the multi-scale features of different layers are highly enriched. We also add another branch in the detection head to predict objectness to reduce easy negative candidates. We propose several different feature fusion models and we have done several experiments to show the advantages of the proposed approach. Our network runs at the speed of 20 FPS (frame per second) which is faster than Faster R-CNN counterpart and our method achieves competitive detection performance.
引用
收藏
页数:6
相关论文
共 27 条
[1]  
[Anonymous], 2017, ARXIV170306870
[2]   Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].
Bell, Sean ;
Zitnick, C. Lawrence ;
Bala, Kavita ;
Girshick, Ross .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883
[3]  
Dai J., 2016, ADV NEURAL INFORM PR, V29, P379, DOI [DOI 10.1016/J.JPOWSOUR.2007.02.075, DOI 10.48550/ARXIV.1605.06409, DOI 10.1109/CVPR.2017.690]
[4]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[5]  
Fu Cheng- Yang, 2013, ARXIV170106659
[6]   Object detection via a multi-region & semantic segmentation-aware CNN model [J].
Gidaris, Spyros ;
Komodakis, Nikos .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1134-1142
[7]  
Girshick R., 2014, P IEEE C COMP VIS PA, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81, 10.1109/cvpr.2014.81]
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]