Orientation-Aware Vehicle Detection in Aerial Images via an Anchor-Free Object Detection Approach

被引:53
作者
Shi, Furong [1 ]
Zhang, Tong [1 ]
Zhang, Tao [2 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[2] Tianjin Inst Geotech Invest Surveying, Tianjin 300191, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2021年 / 59卷 / 06期
基金
中国国家自然科学基金;
关键词
Vehicle detection; Feature extraction; Detectors; Object detection; Task analysis; Proposals; Remote sensing; Anchor-free; convolutional neural network (CNN); multitask learning; orientation prediction; vehicle detection;
D O I
10.1109/TGRS.2020.3011418
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Vehicle detection in aerial images is an important and challenging task in the field of remote sensing. Recently, deep learning technologies have yielded superior performance for object detection in remote sensing images. However, the detection results of the existing methods are horizontal bounding boxes that ignore vehicle orientations, thereby having limited applicability in scenes with dense vehicles or clutter backgrounds. In this article, we propose a one-stage, anchor-free detection approach to detect arbitrarily oriented vehicles in high-resolution aerial images. The vehicle detection task is transformed into a multitask learning problem by directly predicting high-level vehicle features via a fully convolutional network. That is, a classification subtask is created to look for vehicle central points and three regression subtasks are created to predict vehicle orientations, scales, and offsets of vehicle central points. First, coarse and fine feature maps outputted from different stages of a residual network are concatenated together by a feature pyramid fusion strategy. Upon the concatenated features, four convolutional layers are attached in parallel to predict high-level vehicle features. During training, task uncertainty learned from the training data is used to weight loss function in the multitask learning setting. For inferencing, oriented bounding boxes are generated using the predicted vehicle features, and oriented nonmaximum suppression (NMS) postprocessing is used to reduce redundant results. Experiments on two public aerial image data sets have shown the effectiveness of the proposed approach.
引用
收藏
页码:5221 / 5233
页数:13
相关论文
共 54 条
[1]   Deep Learning Approach for Car Detection in UAV Imagery [J].
Ammour, Nassim ;
Alhichri, Haikel ;
Bazi, Yakoub ;
Benjdira, Bilel ;
Alajlan, Naif ;
Zuair, Mansour .
REMOTE SENSING, 2017, 9 (04)
[2]  
[Anonymous], 2011, P IEEE 2011 INT C EL
[3]   Vehicle Detection and Motion Analysis in Low-Altitude Airborne Video Under Urban Environment [J].
Cao, Xianbin ;
Wu, Changxia ;
Lan, Jinhe ;
Yan, Pingkun ;
Li, Xuelong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (10) :1522-1533
[4]   Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks [J].
Chen, Xueyun ;
Xiang, Shiming ;
Liu, Cheng-Lin ;
Pan, Chun-Hong .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (10) :1797-1801
[5]   Vehicle Detection in High-Resolution Aerial Images via Sparse Representation and Superpixels [J].
Chen, Ziyi ;
Wang, Cheng ;
Wen, Chenglu ;
Teng, Xiuhua ;
Chen, Yiping ;
Guan, Haiyan ;
Luo, Huan ;
Cao, Liujuan ;
Li, Jonathan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (01) :103-116
[6]   Toward Fast and Accurate Vehicle Detection in Aerial Images Using Coupled Region-Based Convolutional Neural Networks [J].
Deng, Zhipeng ;
Sun, Hao ;
Zhou, Shilin ;
Zhao, Juanping ;
Zou, Huanxin .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (08) :3652-3664
[7]   Fast Feature Pyramids for Object Detection [J].
Dollar, Piotr ;
Appel, Ron ;
Belongie, Serge ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545
[8]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587