Real-Time object detector based MobileNetV3 for UAV applications

被引:4
作者
Yang, Yonghao [1 ]
Han, Jin [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266000, Peoples R China
关键词
UAV; Object detection; Lightweight; ShufflenetV2; MobileNetV3;
D O I
10.1007/s11042-022-14196-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the continuous progress of UAV (unmanned aerial vehicle) flight technology, more and more outdoor vision tasks begin to rely on UAV to complete, many of which require computer vision algorithms to analyze the information captured by the camera. However, it is difficult to deploy detectors on embedded devices due to the challenges among energy consumption, accuracy, and speed. In this paper, we propose an end-to-end object detection model running on a UAV platform that is suitable for real-time applications. Through the research of shufflenetv2 and mobilenetv3, a new feature extraction network structure is proposed. In order to improve the detection accuracy without losing the detection efficiency, a multi-scale fusion module based on deconvolution is added. Experiments show when deployed on our onboard Nvidia Jetson TX2 for testing and inference, our model combined with a modified focal loss function, produced a desirable performance of 21.7% mAP for object detection with an inference time of 17 fps.
引用
收藏
页码:18709 / 18725
页数:17
相关论文
共 29 条
  • [1] Bochkovskiy A, 2020, ARXIV, DOI DOI 10.48550/ARXIV.2004.10934
  • [2] Energy-Efficient Real-Time UAV., 2020, IEEE T COMPUT AID D, V39, P3123, DOI [10.1109/TCAD.2019.2957724, DOI 10.1109/TCAD.2019.2957724]
  • [3] Aerial Target Detection Based on Improved Faster R-CNN
    Feng Xiaoyu
    Mei Wei
    Hu Dashuai
    [J]. ACTA OPTICA SINICA, 2018, 38 (06)
  • [4] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [5] Howard A G, 2017, ARXIV
  • [6] Searching for MobileNetV3
    Howard, Andrew
    Sandler, Mark
    Chu, Grace
    Chen, Liang-Chieh
    Chen, Bo
    Tan, Mingxing
    Wang, Weijun
    Zhu, Yukun
    Pang, Ruoming
    Vasudevan, Vijay
    Le, Quoc V.
    Adam, Hartwig
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324
  • [7] Kyrkou C, 2018, DES AUT TEST EUROPE, P967, DOI 10.23919/DATE.2018.8342149
  • [8] Visual Detail Augmented Mapping for Small Aerial Target Detection
    Li, Jing
    Dai, Yanran
    Li, Congcong
    Shu, Junqi
    Li, Dongdong
    Yang, Tao
    Lu, Zhaoyang
    [J]. REMOTE SENSING, 2019, 11 (01)
  • [9] Li YZ, 2017, ADV NEUR IN, V30
  • [10] Lin T.-Y., 2017, CoRR, DOI DOI 10.1109/CVPR.2017.106