An empirical study of multi-scale object detection in high resolution UAV images

被引:54
作者
Zhang, Haijun [1 ]
Sun, Mingshan [1 ]
Li, Qun [1 ]
Liu, Linlin [1 ]
Liu, Ming [2 ]
Ji, Yuzhu [1 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci, Shenzhen 518055, Peoples R China
[2] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
UAV image; Object detection; High resolution; Multi-scale; Data set; VEHICLE DETECTION;
D O I
10.1016/j.neucom.2020.08.074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection in images collected by Unmanned Aerial Vehicles (UAVs) constitutes a challenging task in computer vision, due to difficulties of learning a well-trained object detection model for handling instances in UAV images with arbitrary orientations, variation in different scales, irregular shapes, etc. In order to facilitate object detection research and extend its applications in natural scenarios by using UAVs, this paper presents a large-scale benchmark dataset, MOHR, aiming at performing multi-scale object detection in UAV images with high resolution. A total of 90,014 object instances with labels and bounding boxes were annotated. In order to build a baseline for object detection on the MOHR dataset, we performed an empirical study by evaluating six state-of-the-art deep learning-based object detection models trained on our proposed dataset. Experimental results show promising detection performance, but also demonstrate that the dataset is quite challenging for adopting natural image-based object detection models for UAV images. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:173 / 182
页数:10
相关论文
共 32 条
[21]   Ship Rotated Bounding Box Space for Ship Extraction From High-Resolution Optical Satellite Images With Complex Backgrounds [J].
Liu, Zikun ;
Wang, Hongzhen ;
Weng, Lubin ;
Yang, Yiping .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (08) :1074-1078
[22]  
Lyu Ye, 2018, ARXIV181010438
[23]  
Mohammed F, 2014, INT CONF UNMAN AIRCR, P267, DOI 10.1109/ICUAS.2014.6842265
[24]   A Benchmark and Simulator for UAV Tracking [J].
Mueller, Matthias ;
Smith, Neil ;
Ghanem, Bernard .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :445-461
[25]   Deep Learning Approaches for Detecting Objects from Images: A Review [J].
Pathak, Ajeet Ram ;
Pandey, Manjusha ;
Rautaray, Siddharth .
PROGRESS IN COMPUTING, ANALYTICS AND NETWORKING, ICCAN 2017, 2018, 710 :491-499
[26]   Vehicle detection in aerial imagery : A small target detection benchmark [J].
Razakarivony, Sebastien ;
Jurie, Frederic .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 34 :187-203
[27]  
Redmon J, 2017, P IEEE C COMPUTER VI, P7263
[28]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[29]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[30]  
Robicquet A., 2016, LECT NOTES COMPUT SC, P549, DOI DOI 10.1007/978-3-319-46484-8_33