Toward Fast and Accurate Vehicle Detection in Aerial Images Using Coupled Region-Based Convolutional Neural Networks

被引:168
作者
Deng, Zhipeng [1 ]
Sun, Hao [1 ]
Zhou, Shilin [1 ]
Zhao, Juanping [2 ]
Zou, Huanxin [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Engn, Changsha 410073, Hunan, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Attribute learning; convolutional neural networks; vehicle detection; vehicle proposal network; OBJECT DETECTION; FEATURES; CARS;
D O I
10.1109/JSTARS.2017.2694890
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Vehicle detection in aerial images, being an interesting but challenging problem, plays an important role for a wide range of applications. Traditional methods are based on sliding-window search and handcrafted or shallow-learning-based features with heavy computational costs and limited representation power. Recently, deep learning algorithms, especially region-based convolutional neural networks (R-CNNs), have achieved state-of-the-art detection performance in computer vision. However, several challenges limit the applications of R-CNNs in vehicle detection from aerial images: 1) vehicles in large-scale aerial images are relatively small in size, and R-CNNs have poor localization performance with small objects; 2) R-CNNs are particularly designed for detecting the bounding box of the targets without extracting attributes; 3) manual annotation is generally expensive and the available manual annotation of vehicles for training R-CNNs are not sufficient in number. To address these problems, this paper proposes a fast and accurate vehicle detection framework. On one hand, to accurately extract vehicle-like targets, we developed an accurate-vehicle-proposal-network (AVPN) based on hyper feature map which combines hierarchical feature maps that are more accurate for small object detection. On the other hand, we propose a coupled R-CNN method, which combines an AVPN and a vehicle attribute learning network to extract the vehicle's location and attributes simultaneously. For original large-scale aerial images with limited manual annotations, we use cropped image blocks for training with data augmentation to avoid overfitting. Comprehensive evaluations on the public Munich vehicle dataset and the collected vehicle dataset demonstrate the accuracy and effectiveness of the proposed method.
引用
收藏
页码:3652 / 3664
页数:13
相关论文
共 52 条
[21]   A New Building Extraction Postprocessing Framework for High-Spatial-Resolution Remote-Sensing Imagery [J].
Huang, Xin ;
Yuan, Wenliang ;
Li, Jiayi ;
Zhang, Liangpei .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (02) :654-668
[22]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[23]   Vehicle Detection Using Partial Least Squares [J].
Kembhavi, Aniruddha ;
Harwood, David ;
Davis, Larry S. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (06) :1250-1265
[24]  
Kim K, 2016, INT S HIGH PERF COMP, P163, DOI 10.1109/HPCA.2016.7446062
[25]  
Kluckner S, 2007, IEEE I CONF COMP VIS, P47
[26]   HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection [J].
Kong, Tao ;
Yao, Anbang ;
Chen, Yurong ;
Sun, Fuchun .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :845-853
[27]   Building Detection Using Enhanced HOG-LBP Features and Region Refinement Processes [J].
Konstantinidis, Dimitrios ;
Stathaki, Tania ;
Argyriou, Vasileios ;
Grammalidis, Nikolaos .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (03) :888-905
[28]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[29]   Backpropagation Applied to Handwritten Zip Code Recognition [J].
LeCun, Y. ;
Boser, B. ;
Denker, J. S. ;
Henderson, D. ;
Howard, R. E. ;
Hubbard, W. ;
Jackel, L. D. .
NEURAL COMPUTATION, 1989, 1 (04) :541-551
[30]   An Operational System for Estimating Road Traffic Information from Aerial Images [J].
Leitloff, Jens ;
Rosenbaum, Dominik ;
Kurz, Franz ;
Meynberg, Oliver ;
Reinartz, Peter .
REMOTE SENSING, 2014, 6 (11) :11315-11341