Multiscale leapfrog structure: An efficient object detector architecture designed for unmanned aerial vehicles

被引:6
作者
Gong, Lixiong [1 ]
Huang, Xiao [1 ]
Chen, Jialin [1 ]
Xiao, Miaoling [1 ]
Chao, Yinkang [1 ]
机构
[1] Hubei Univ Technol, Sch Mech Engn, Wuhan, Peoples R China
关键词
Reparameterization; UAV object detection; Self-attention; Variability boundary activation function; NETWORKS;
D O I
10.1016/j.engappai.2023.107270
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) have achieved remarkable performance in various computer vision tasks, including object detection. However, for object detection from unmanned aerial vehicles (UAVs), which is a complex task with complex backgrounds and limited resources, challenges (e.g., information losses for edges and small objects and high computational costs) exist. To address these issues, we propose a CNN building block called reparameterized fusion convolution (RFConv), which incorporates multiscale convolution branches to capture small object information and expand the receptive field. During inference, reparameterization reduces the computational overhead by pruning. Moreover, we discover that convolution and self-attention, two powerful techniques, exhibit design paradigm differences, but many of the computations are actually accomplished through similar operations. Consequently, we propose a hybrid module to enable parameter reuse and information flow between convolution and self-attention, harnessing the advantages of both methods with minimal computational costs for detecting edges and small objects. Based on different combinations of the hybrid module and RFConv, we design a diverse multiscale leapfrog structure (MLS) to satisfy various usage requirements. Additionally, we propose a variability boundary activation function (VB) that can reuse network information to adaptively adjust the nonlinearity and gradient characteristics, effectively addressing the distinct activation function requirements of convolution and self-attention. We incorporated our proposed method into YOLOv5s, achieving 95.46 AP0.5 and 27.3 AP0.5 on UAV datasets (NWPU VHR-10 and VisDrone2019), and into YOLOv5m, obtaining 92.58 mAP on the general dataset (PASCAL VOC), to demonstrate the ability of our method to enhance the effectiveness of existing detectors.
引用
收藏
页数:16
相关论文
共 65 条
[1]   Adventures in data analysis: a systematic review of Deep Learning techniques for pattern recognition in cyber-physical-social systems [J].
Amiri, Zahra ;
Heidari, Arash ;
Navimipour, Nima Jafari ;
Unal, Mehmet ;
Mousavi, Ali .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) :22909-22973
[2]   A novel optimized parametric hyperbolic tangent swish activation function for 1D-CNN: application of sensor-based human activity recognition and anomaly detection [J].
Ankalaki, Shilpa ;
Thippeswamy, M. N. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (22) :61789-61819
[3]  
Bishop CM., 1994, Mixture density networks
[4]   A Systematic Review of Drone Based Road Traffic Monitoring System [J].
Bisio, Igor ;
Garibotto, Chiara ;
Haleem, Halar ;
Lavagetto, Fabio ;
Sciarrone, Andrea .
IEEE ACCESS, 2022, 10 :101537-101555
[5]   A General Survey on Attention Mechanisms in Deep Learning [J].
Brauwers, Gianni ;
Frasincar, Flavius .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) :3279-3298
[6]   An Improved Lightweight Real-Time Detection Algorithm Based on the Edge Computing Platform for UAV Images [J].
Cao, Lijia ;
Song, Pinde ;
Wang, Yongchao ;
Yang, Yang ;
Peng, Baoyu .
ELECTRONICS, 2023, 12 (10)
[7]   ReLU-type Hopfield neural network with analog hardware implementation [J].
Chen, Chengjie ;
Min, Fuhong ;
Zhang, Yunzhen ;
Bao, Han .
CHAOS SOLITONS & FRACTALS, 2023, 167
[8]   Plant disease detection using drones in precision agriculture [J].
Chin, Ruben ;
Catal, Cagatay ;
Kassahun, Ayalew .
PRECISION AGRICULTURE, 2023, 24 (05) :1663-1682
[9]   S-Swin Transformer: simplified Swin Transformer model for offline handwritten Chinese character recognition [J].
Dan, Yongping ;
Zhu, Zongnan ;
Jin, Weishou ;
Li, Zhuo .
PEERJ COMPUTER SCIENCE, 2022, 8
[10]   A lightweight YOLOv3 algorithm used for safety helmet detection [J].
Deng, Lixia ;
Li, Hongquan ;
Liu, Haiying ;
Gu, Jason .
SCIENTIFIC REPORTS, 2022, 12 (01)