YOLO-SE: Improved YOLOv8 for Remote Sensing Object Detection and Recognition

被引:86
作者
Wu, Tianyong [1 ]
Dong, Youkou [1 ]
机构
[1] China Univ Geosci, Coll Marine Sci & Technol, Wuhan 430074, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 24期
关键词
object detection; remote sensing images; multi-scale; loss functions;
D O I
10.3390/app132412977
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Object detection remains a pivotal aspect of remote sensing image analysis, and recent strides in Earth observation technology coupled with convolutional neural networks (CNNs) have propelled the field forward. Despite advancements, challenges persist, especially in detecting objects across diverse scales and pinpointing small-sized targets. This paper introduces YOLO-SE, a novel YOLOv8-based network that innovatively addresses these challenges. First, the introduction of a lightweight convolution SEConv in lieu of standard convolutions reduces the network's parameter count, thereby expediting the detection process. To tackle multi-scale object detection, the paper proposes the SEF module, an enhancement based on SEConv. Second, an ingenious Efficient Multi-Scale Attention (EMA) mechanism is integrated into the network, forming the SPPFE module. This addition augments the network's feature extraction capabilities, adeptly handling challenges in multi-scale object detection. Furthermore, a dedicated prediction head for tiny object detection is incorporated, and the original detection head is replaced by a transformer prediction head. To address adverse gradients stemming from low-quality instances in the target detection training dataset, the paper introduces the Wise-IoU bounding box loss function. YOLO-SE showcases remarkable performance, achieving an average precision at IoU threshold 0.5 (AP50) of 86.5% on the optical remote sensing dataset SIMD. This represents a noteworthy 2.1% improvement over YOLOv8 and YOLO-SE outperforms the state-of-the-art model by 0.91%. In further validation, experiments on the NWPU VHR-10 dataset demonstrated YOLO-SE's superiority with an accuracy of 94.9%, surpassing that of YOLOv8 by 2.6%. The proposed advancements position YOLO-SE as a compelling solution in the realm of deep learning-based remote sensing image object detection.
引用
收藏
页数:21
相关论文
共 48 条
[31]   PBNet: Part-based convolutional neural network for complex composite object detection in remote sensing imagery [J].
Sun, Xian ;
Wang, Peijin ;
Wang, Cheng ;
Liu, Yingfei ;
Fu, Kun .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 173 :50-65
[32]   Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining [J].
Tang, Tianyu ;
Zhou, Shilin ;
Deng, Zhipeng ;
Zou, Huanxin ;
Lei, Lin .
SENSORS, 2017, 17 (02)
[33]  
Tong ZJ, 2023, Arxiv, DOI [arXiv:2301.10051, DOI 10.48550/ARXIV.2301.10051]
[34]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007
[35]  
Vaswani A, 2017, ADV NEUR IN, V30
[36]   YOLO-HR: Improved YOLOv5 for Object Detection in High-Resolution Optical Remote Sensing Images [J].
Wan, Dahang ;
Lu, Rongsheng ;
Wang, Sailei ;
Shen, Siyuan ;
Xu, Ting ;
Lang, Xianli .
REMOTE SENSING, 2023, 15 (03)
[37]   YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors [J].
Wang, Chien-Yao ;
Bochkovskiy, Alexey ;
Liao, Hong-Yuan Mark .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :7464-7475
[38]   CBAM: Convolutional Block Attention Module [J].
Woo, Sanghyun ;
Park, Jongchan ;
Lee, Joon-Young ;
Kweon, In So .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19
[39]   Improved YOLO-V3 with DenseNet for Multi-Scale Remote Sensing Target Detection [J].
Xu, Danqing ;
Wu, Yiquan .
SENSORS, 2020, 20 (15) :1-24
[40]   Weakly Supervised Learning Based on Coupled Convolutional Neural Networks for Aircraft Detection [J].
Zhang, Fan ;
Du, Bo ;
Zhang, Liangpei ;
Xu, Miaozhong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (09) :5553-5563