YOLO-DRS: A Bioinspired Object Detection Algorithm for Remote Sensing Images Incorporating a Multi-Scale Efficient Lightweight Attention Mechanism

被引：8

作者：

Liao, Huan ^{[1
]}

Zhu, Wenqiu ^{[1
]}

机构：

[1] Hunan Univ Technol, Sch Comp Sci, Zhuzhou 412007, Peoples R China

来源：

BIOMIMETICS | 2023年 / 8卷 / 06期

关键词：

bioinspired object detection; YOLOv5; multi-scale; attention mechanisms; transposed convolution;

D O I：

10.3390/biomimetics8060458

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Bioinspired object detection in remotely sensed images plays an important role in a variety of fields. Due to the small size of the target, complex background information, and multi-scale remote sensing images, the generalized YOLOv5 detection framework is unable to obtain good detection results. In order to deal with this issue, we proposed YOLO-DRS, a bioinspired object detection algorithm for remote sensing images incorporating a multi-scale efficient lightweight attention mechanism. First, we proposed LEC, a lightweight multi-scale module for efficient attention mechanisms. The fusion of multi-scale feature information allows the LEC module to completely improve the model's ability to extract multi-scale targets and recognize more targets. Then, we propose a transposed convolutional upsampling alternative to the original nearest-neighbor interpolation algorithm. Transposed convolutional upsampling has the potential to greatly reduce the loss of feature information by learning the feature information dynamically, thereby reducing problems such as missed detections and false detections of small targets by the model. Our proposed YOLO-DRS algorithm exhibits significant improvements over the original YOLOv5s. Specifically, it achieves a 2.3% increase in precision (P), a 3.2% increase in recall (R), and a 2.5% increase in mAP@0.5. Notably, the introduction of the LEC module and transposed convolutional results in a respective improvement of 2.2% and 2.1% in mAP@0.5. In addition, YOLO-DRS only increased the GFLOPs by 0.2. In comparison to the state-of-the-art algorithms, namely YOLOv8s and YOLOv7-tiny, YOLO-DRS demonstrates significant improvements in the mAP@0.5 metrics, with enhancements ranging from 1.8% to 7.3%. It is fully proved that our YOLO-DRS can reduce the missed and false detection problems of remote sensing target detection.

引用

页数：18

共 44 条

[1] [Anonymous], 1998, Advances in Kernel Methods-Support Vector Learning
[2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934, 10.48550/arXiv.2004.10934]
[3] Research on Airplane and Ship Detection of Aerial Remote Sensing Images Based on Convolutional Neural Network
Cao, Changqing
Wu, Jin
Zeng, Xiaodong
Feng, Zhejun
Wang, Ting
Yan, Xu
Wu, Zengyan
Wu, Qifan
Huang, Ziqiang
[J]. SENSORS, 2020, 20 (17) : 1 - 16
[4] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[5] Improvement of Lightweight Convolutional Neural Network Model Based on YOLO Algorithm and Its Research in Pavement Defect Detection
Du, Fu-Jun
Jiao, Shuang-Jian
[J]. SENSORS, 2022, 22 (09)
[6] Sigmoid-weighted linear units for neural network function approximation in reinforcement learning
Elfwing, Stefan
Uchibe, Eiji
Doya, Kenji
[J]. NEURAL NETWORKS, 2018, 107 : 3 - 11
[7] Cascade Object Detection with Deformable Part Models
Felzenszwalb, Pedro F.
Girshick, Ross B.
McAllester, David
[J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2241 - 2248
[8] Additive logistic regression: A statistical view of boosting - Rejoinder
Friedman, J
Hastie, T
Tibshirani, R
[J]. ANNALS OF STATISTICS, 2000, 28 (02) : 400 - 407
[9] Res2Net: A New Multi-Scale Backbone Architecture
Gao, Shang-Hua
Cheng, Ming-Ming
Zhao, Kai
Zhang, Xin-Yu
Yang, Ming-Hsuan
Torr, Philip
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
[10] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587

← 1 2 3 4 5 →