Object Detection in UAV Images via Global Density Fused Convolutional Network

被引：68

作者：

Zhang, Ruiqian ^{[1
]}

Shao, Zhenfeng ^{[2
]}

Huang, Xiao ^{[3
]}

Wang, Jiaming ^{[2
]}

Li, Deren ^{[2
]}

机构：

[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China

[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China

[3] Univ Arkansas, Dept Geosci, Fayetteville, AR 72701 USA

来源：

REMOTE SENSING | 2020年 / 12卷 / 19期

基金：

中国国家自然科学基金;

关键词：

object detection; UAV images; global density model; global density fused convolutional network;

D O I：

10.3390/rs12193140

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Object detection in Unmanned Aerial Vehicle (UAV) images plays fundamental roles in a wide variety of applications. As UAVs are maneuverable with high speed, multiple viewpoints, and varying altitudes, objects in UAV images are distributed with great heterogeneity, varying in size, with high density, bringing great difficulty to object detection using existing algorithms. To address the above issues, we propose a novel global density fused convolutional network (GDF-Net) optimized for object detection in UAV images. We test the effectiveness and robustness of the proposed GDF-Nets on the VisDrone dataset and the UAVDT dataset. The designed GDF-Net consists of a Backbone Network, a Global Density Model (GDM), and an Object Detection Network. Specifically, GDM refines density features via the application of dilated convolutional networks, aiming to deliver larger reception fields and to generate global density fused features. Compared with base networks, the addition of GDM improves the model performance in both recall and precision. We also find that the designed GDM facilitates the detection of objects in congested scenes with high distribution density. The presented GDF-Net framework can be instantiated to not only the base networks selected in this study but also other popular object detection models.

引用

页码：1 / 17

页数：17

共 70 条

[41] See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks [J].

Lu, Xiankai ;

Wang, Wenguan ;

Ma, Chao ;

Shen, Jianbing ;

Shao, Ling ;

Porikli, Fatih .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3618-3627

[42] Real-Time Detection of Ground Objects Based on Unmanned Aerial Vehicle Remote Sensing with Deep Learning: Application in Excavator Detection for Pipeline Safety [J].

Meng, Lingxuan ;

Peng, Zhixing ;

Zhou, Ji ;

Zhang, Jirong ;

Lu, Zhenyu ;

Baumann, Andreas ;

Du, Yan .

REMOTE SENSING, 2020, 12 (01)

[43] G-CNN: an Iterative Grid Based Object Detector [J].

Najibi, Mahyar ;

Rastegari, Mohammad ;

Davis, Larry S. .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2369-2377

[44] Libra R-CNN: Towards Balanced Learning for Object Detection [J].

Pang, Jiangmiao ;

Chen, Kai ;

Shi, Jianping ;

Feng, Huajun ;

Ouyang, Wanli ;

Lin, Dahua .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :821-830

[45] A Novel Object-Based Deep Learning Framework for Semantic Segmentation of Very High-Resolution Remote Sensing Data: Comparison with Convolutional and Fully Convolutional Networks [J].

Papadomanolaki, Maria ;

Vakalopoulou, Maria ;

Karantzalos, Konstantinos .

REMOTE SENSING, 2019, 11 (06)

[46]

Portmann J, 2014, IEEE INT CONF ROBOT, P1794, DOI 10.1109/ICRA.2014.6907094

[47] Object Detection in Remote Sensing Images Based on Improved Bounding Box Regression and Multi-Level Features Fusion [J].

Qian, Xiaoliang ;

Lin, Sheng ;

Cheng, Gong ;

Yao, Xiwen ;

Ren, Hangli ;

Wang, Wei .

REMOTE SENSING, 2020, 12 (01)

[48]

Redmon J, 2018, Arxiv, DOI [arXiv:1804.02767, DOI 10.48550/ARXIV.1804.02767]

[49] YOLO9000: Better, Faster, Stronger [J].

Redmon, Joseph ;

Farhadi, Ali .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6517-6525

[50] You Only Look Once: Unified, Real-Time Object Detection [J].

Redmon, Joseph ;

Divvala, Santosh ;

Girshick, Ross ;

Farhadi, Ali .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788

← 1 2 3 4 5 6 7 →