Real-Time object detector based MobileNetV3 for UAV applications

被引:4
作者
Yang, Yonghao [1 ]
Han, Jin [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266000, Peoples R China
关键词
UAV; Object detection; Lightweight; ShufflenetV2; MobileNetV3;
D O I
10.1007/s11042-022-14196-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the continuous progress of UAV (unmanned aerial vehicle) flight technology, more and more outdoor vision tasks begin to rely on UAV to complete, many of which require computer vision algorithms to analyze the information captured by the camera. However, it is difficult to deploy detectors on embedded devices due to the challenges among energy consumption, accuracy, and speed. In this paper, we propose an end-to-end object detection model running on a UAV platform that is suitable for real-time applications. Through the research of shufflenetv2 and mobilenetv3, a new feature extraction network structure is proposed. In order to improve the detection accuracy without losing the detection efficiency, a multi-scale fusion module based on deconvolution is added. Experiments show when deployed on our onboard Nvidia Jetson TX2 for testing and inference, our model combined with a modified focal loss function, produced a desirable performance of 21.7% mAP for object detection with an inference time of 17 fps.
引用
收藏
页码:18709 / 18725
页数:17
相关论文
共 29 条
  • [11] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
  • [12] ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design
    Ma, Ningning
    Zhang, Xiangyu
    Zheng, Hai-Tao
    Sun, Jian
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 122 - 138
  • [13] Large Kernel Matters - Improve Semantic Segmentation by Global Convolutional Network
    Peng, Chao
    Zhang, Xiangyu
    Yu, Gang
    Luo, Guiming
    Sun, Jian
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1743 - 1751
  • [14] Redmon J, 2018, ARXIV
  • [15] Redmon J, 2018, Arxiv, DOI [arXiv:1804.02767, DOI 10.48550/ARXIV.1804.02767, DOI 10.1109/CVPR.2017.690]
  • [16] You Only Look Once: Unified, Real-Time Object Detection
    Redmon, Joseph
    Divvala, Santosh
    Girshick, Ross
    Farhadi, Ali
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 779 - 788
  • [17] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [18] MobileNetV2: Inverted Residuals and Linear Bottlenecks
    Sandler, Mark
    Howard, Andrew
    Zhu, Menglong
    Zhmoginov, Andrey
    Chen, Liang-Chieh
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4510 - 4520
  • [19] Sermanet Pierre, 2013, arXiv
  • [20] Singh PP, 2019, INT C COMP VIS IM PR, ppp373