Real-Time object detector based MobileNetV3 for UAV applications

被引：4

作者：

Yang, Yonghao ^{[1
]}

Han, Jin ^{[1
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266000, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 82卷 / 12期

关键词：

UAV; Object detection; Lightweight; ShufflenetV2; MobileNetV3;

D O I：

10.1007/s11042-022-14196-x

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the continuous progress of UAV (unmanned aerial vehicle) flight technology, more and more outdoor vision tasks begin to rely on UAV to complete, many of which require computer vision algorithms to analyze the information captured by the camera. However, it is difficult to deploy detectors on embedded devices due to the challenges among energy consumption, accuracy, and speed. In this paper, we propose an end-to-end object detection model running on a UAV platform that is suitable for real-time applications. Through the research of shufflenetv2 and mobilenetv3, a new feature extraction network structure is proposed. In order to improve the detection accuracy without losing the detection efficiency, a multi-scale fusion module based on deconvolution is added. Experiments show when deployed on our onboard Nvidia Jetson TX2 for testing and inference, our model combined with a modified focal loss function, produced a desirable performance of 21.7% mAP for object detection with an inference time of 17 fps.

引用

页码：18709 / 18725

页数：17

共 29 条

[11] SSD: Single Shot MultiBox Detector
Liu, Wei
Anguelov, Dragomir
Erhan, Dumitru
Szegedy, Christian
Reed, Scott
Fu, Cheng-Yang
Berg, Alexander C.
[J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
[12] ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design
Ma, Ningning
Zhang, Xiangyu
Zheng, Hai-Tao
Sun, Jian
[J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 122 - 138
[13] Large Kernel Matters - Improve Semantic Segmentation by Global Convolutional Network
Peng, Chao
Zhang, Xiangyu
Yu, Gang
Luo, Guiming
Sun, Jian
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1743 - 1751
[14] Redmon J, 2018, ARXIV
[15] Redmon J, 2018, Arxiv, DOI [arXiv:1804.02767, DOI 10.48550/ARXIV.1804.02767, DOI 10.1109/CVPR.2017.690]
[16] You Only Look Once: Unified, Real-Time Object Detection
Redmon, Joseph
Divvala, Santosh
Girshick, Ross
Farhadi, Ali
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 779 - 788
[17] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Ren, Shaoqing
He, Kaiming
Girshick, Ross
Sun, Jian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
[18] MobileNetV2: Inverted Residuals and Linear Bottlenecks
Sandler, Mark
Howard, Andrew
Zhu, Menglong
Zhmoginov, Andrey
Chen, Liang-Chieh
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4510 - 4520
[19] Sermanet Pierre, 2013, arXiv
[20] Singh PP, 2019, INT C COMP VIS IM PR, ppp373

← 1 2 3 →