Real-time Single Object Detection on The UAV

被引：0

作者：

Wu, Hsiang-Huang ^{[1
]}

Zhou, Zejian ^{[2
]}

Feng, Ming ^{[2
]}

Yan, Yuzhong ^{[1
]}

Xu, Hao ^{[2
]}

Qian, Lijun ^{[1
]}

机构：

[1] Texas A&M Univ Syst, Prairie View A&M Univ, Ctr Excellence Res & Educ Big Mil Data Intelligen, Prairie View, TX 77446 USA

[2] Univ Nevada, Dept Elect & Biomed Engn, Reno, NV 89557 USA

来源：

2019 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS' 19) | 2019年

关键词：

Computer Vision; Convolutional Neural Network (CNN); Embedded System; UAV; Multi-task Learning;

D O I：

10.1109/icuas.2019.8797866

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The demand for mission critical tasks, especially for tracking on the UAVs, has been increasing due to their superior mobility. Out of necessity, the ability of processing large images emerges for object detection or tracking with UAVs. As such, the requirements of low latency and lack of Internet access under some circumstances become the major challenges. In this paper, we present a modeling method of CNN that is dedicated to single object detection on the UAV without any transfer learning model. Not limited to the features learned by the transfer learning model, the single object can be selected arbitrarily and specifically, even can be distinguished from those other objects in the same category. Our modeling method introduces the inducing neural network that follows the traditional CNN and plays the role of guiding the training in a fast and efficient way with respect to the training convergence and the model capacity. Using the dataset released by DAC 2018, which contains 98 classes and 96,408 images taken by UAVs, we present how our modeling method develops the inducing neural network that integrates multi-task learning drawn from the state-of-the-art works to achieve about 50% of IoU (Intersection over Union of the ground-truth bounding boxes and predicted bounding boxes) and 20 FPS running on NVIDIA Jetson TX2. Experimental results demonstrated fast inference of an image in size of 720x1280 and the UAV navigated itself to track the target (car) using the inference result.

引用

页码：1013 / 1022

页数：10

共 23 条

[1]

[Anonymous], TINY SSD TINY SINGLE

[2]

[Anonymous], ACM J EMERG TECH COM

[3]

[Anonymous], 2014, P BRIT MACH VIS C

[4]

[Anonymous], 2017, ARXIV170404861

[5]

[Anonymous], 2018, BMVC

[6]

[Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031

[7]

[Anonymous], 2017, IEEE C COMP VIS PATT

[8]

[Anonymous], ADV NEURAL INFORM PR

[9]

[Anonymous], 2013, arXiv

[10]

[Anonymous], 2017, Learning non-maximum suppression

← 1 2 3 →