Feature Enhancement for Multi-scale Object Detection

被引:1
作者
Huicheng Zheng
Jiajie Chen
Lvran Chen
Ye Li
Zhiwei Yan
机构
[1] Sun Yat-sen University,School of Data and Computer Science
[2] Ministry of Education,Key Laboratory of Machine Intelligence and Advanced Computing
[3] China Southern Power Grid,Digital Grid Research Institute
来源
Neural Processing Letters | 2020年 / 51卷
关键词
Object detection; Deep learning; Oriented gradient features; Dilated convolution;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, deep learning has brought great progress in object detection. However, we believe that traditional hand-crafted features may still contain valuable human knowledge complementary to features learned from raw data. Besides, almost all top-performing object detection methods extract features by using backbones originally designed for image classification. The generated features are often highly semantic, which is beneficial to global image classification, but may lose details useful for object localization and recognition under various scales. To alleviate the problems mentioned above, a feature enhancement method is proposed in this paper. Inspired by the success of histograms of oriented gradients in traditional object detection research, we construct feature channels based on oriented gradients as input to convolutional neural networks to capture discriminative local orientations. The oriented gradients and RGB features are stacked as input of network to enhance the input feature representation. For accurate object localization and recognition, we employ dilated convolutions to increase spatial resolutions of output feature maps while maintaining their respective receptive fields. Hierarchical feature maps with different receptive fields are aggregated into the final feature representation for multi-scale object detection without extra upsampling. Experimental results on PASCAL VOC 2007 and 2012 demonstrate superiority of the proposed method compared with state-of-the-art methods for multi-scale object detection.
引用
收藏
页码:1907 / 1919
页数:12
相关论文
共 11 条
[1]  
Pang S(2018)Deep learning and preference learning for object tracking: a combined approach Neural Process Lett 47 859-876
[2]  
Yu Z(2004)Distinctive image features from scale-invariant keypoints Int J Comput Vis 60 91-110
[3]  
Luaces O(2014)Fast feature pyramids for object detection IEEE Trans Pattern Anal Mach Intell 36 1532-1545
[4]  
Lowe DG(2015)The PASCAL visual object classes challenge: a retrospective Int J Comput Vis 111 98-136
[5]  
Dollar P(undefined)undefined undefined undefined undefined-undefined
[6]  
Appel R(undefined)undefined undefined undefined undefined-undefined
[7]  
Belongie S(undefined)undefined undefined undefined undefined-undefined
[8]  
Perona P(undefined)undefined undefined undefined undefined-undefined
[9]  
Everingham M(undefined)undefined undefined undefined undefined-undefined
[10]  
Eslami SA(undefined)undefined undefined undefined undefined-undefined