An Improved YOLOv8-Based Lightweight Attention Mechanism for Cross-Scale Feature Fusion

被引:3
作者
Liu, Shaodong [1 ]
Shao, Faming [1 ]
Chu, Weijun [1 ]
Dai, Juying [1 ]
Zhang, Heng [1 ]
机构
[1] Army Engn Univ PLA, Coll Field Engn, Nanjing 210007, Peoples R China
基金
中国国家自然科学基金;
关键词
aerial small targets; lightweight model; attention mechanism; feature fusion; loss function; CONVOLUTION; NETWORK;
D O I
10.3390/rs17061044
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper addresses the challenge of small object detection in remote sensing image recognition by proposing an improved YOLOv8-based lightweight attention cross-scale feature fusion model named LACF-YOLO. Prior to the backbone network outputting feature maps, this model introduces a lightweight attention module, Triplet Attention, and replaces the Concatenation with Fusion (C2f) with a more convenient and higher-performing dilated inverted convolution layer to acquire richer contextual information during the feature extraction phase. Additionally, it employs convolutional blocks composed of partial convolution and pointwise convolution as the main body of the cross-scale feature fusion network to integrate feature information from different levels. The model also utilizes the faster-converging Focal EIOU loss function to enhance accuracy and efficiency. Experimental results on the DOTA and VisDrone2019 datasets demonstrate the effectiveness of the improved model. Compared to the original YOLOv8 model, LACF-YOLO achieves a 2.9% increase in mAP and a 4.6% increase in mAPS on the DOTA dataset and a 3.5% increase in mAP and a 3.8% increase in mAPS on the VisDrone2019 dataset, with a 34.9% reduction in the number of parameters and a 26.2% decrease in floating-point operations. The model exhibits superior performance in aerial object detection.
引用
收藏
页数:36
相关论文
共 64 条
[1]   How does shrimp farming impact agricultural production and food security in coastal Bangladesh? Evidence from farmer perception and remote sensing approach [J].
Ahmed, Zia ;
Ambinakudige, Shrinidhi .
OCEAN & COASTAL MANAGEMENT, 2024, 255
[2]   Multiscale Dynamic Attention and Hierarchical Spatial Aggregation for Few-Shot Object Detection [J].
An, Yining ;
Song, Chunlin .
APPLIED SCIENCES-BASEL, 2025, 15 (03)
[3]   Remote sensing of diverse urban environments: From the single city to multiple cities [J].
Chen, Gang ;
Zhou, Yuyu ;
Voogt, James A. ;
Stokes, Eleanor C. .
REMOTE SENSING OF ENVIRONMENT, 2024, 305
[4]   SemiRoadExNet: A semi-supervised network for road extraction from remote sensing imagery via adversarial learning [J].
Chen, Hao ;
Li, Zhenghong ;
Wu, Jiangjiang ;
Xiong, Wei ;
Du, Chun .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 198 :169-183
[5]   Dynamic YOLO for small underwater object detection [J].
Chen, Jie ;
Er, Meng Joo .
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (07)
[6]   SFA-guided mosaic transformer for tracking small objects in snapshot spectral imaging [J].
Chen, Lulu ;
Zhao, Yongqiang ;
Kong, Seong G. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 204 :223-236
[7]  
Chen XX, 2009, 2009 JOINT URBAN REMOTE SENSING EVENT, VOLS 1-3, P555
[8]   Scale-Aware Domain Adaptive Faster R-CNN [J].
Chen, Yuhua ;
Wang, Haoran ;
Li, Wen ;
Sakaridis, Christos ;
Dai, Dengxin ;
Van Gool, Luc .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (07) :2223-2243
[9]   Big Data for Remote Sensing: Challenges and Opportunities [J].
Chi, Mingmin ;
Plaza, Antonio ;
Benediktsson, Jon Atli ;
Sun, Zhongyi ;
Shen, Jinsheng ;
Zhu, Yangyong .
PROCEEDINGS OF THE IEEE, 2016, 104 (11) :2207-2219
[10]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297