YOLO-SE: Improved YOLOv8 for Remote Sensing Object Detection and Recognition

被引：86

作者：

Wu, Tianyong ^{[1
]}

Dong, Youkou ^{[1
]}

机构：

[1] China Univ Geosci, Coll Marine Sci & Technol, Wuhan 430074, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 24期

关键词：

object detection; remote sensing images; multi-scale; loss functions;

D O I：

10.3390/app132412977

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Object detection remains a pivotal aspect of remote sensing image analysis, and recent strides in Earth observation technology coupled with convolutional neural networks (CNNs) have propelled the field forward. Despite advancements, challenges persist, especially in detecting objects across diverse scales and pinpointing small-sized targets. This paper introduces YOLO-SE, a novel YOLOv8-based network that innovatively addresses these challenges. First, the introduction of a lightweight convolution SEConv in lieu of standard convolutions reduces the network's parameter count, thereby expediting the detection process. To tackle multi-scale object detection, the paper proposes the SEF module, an enhancement based on SEConv. Second, an ingenious Efficient Multi-Scale Attention (EMA) mechanism is integrated into the network, forming the SPPFE module. This addition augments the network's feature extraction capabilities, adeptly handling challenges in multi-scale object detection. Furthermore, a dedicated prediction head for tiny object detection is incorporated, and the original detection head is replaced by a transformer prediction head. To address adverse gradients stemming from low-quality instances in the target detection training dataset, the paper introduces the Wise-IoU bounding box loss function. YOLO-SE showcases remarkable performance, achieving an average precision at IoU threshold 0.5 (AP50) of 86.5% on the optical remote sensing dataset SIMD. This represents a noteworthy 2.1% improvement over YOLOv8 and YOLO-SE outperforms the state-of-the-art model by 0.91%. In further validation, experiments on the NWPU VHR-10 dataset demonstrated YOLO-SE's superiority with an accuracy of 94.9%, surpassing that of YOLOv8 by 2.6%. The proposed advancements position YOLO-SE as a compelling solution in the realm of deep learning-based remote sensing image object detection.

引用

页数：21

共 48 条

[1]

Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]

[2] GCL-YOLO: A GhostConv-Based Lightweight YOLO Network for UAV Small Object Detection [J].

Cao, Jinshan ;

Bao, Wenshu ;

Shang, Haixing ;

Yuan, Ming ;

Cheng, Qian .

REMOTE SENSING, 2023, 15 (20)

[3] Consensus Techniques for Unsupervised Binary Change Detection Using Multi-Scale Segmentation Detectors for Land Cover Vegetation Images [J].

Cardama, F. Javier ;

Heras, Dora B. ;

Arguello, Francisco .

REMOTE SENSING, 2023, 15 (11)

[4]

Chen C-F, 2021, arXiv, DOI 10.48550/arXiv.2106.02689

[5]

Chen M, 2020, PR MACH LEARN RES, V119

[6]

Chu XX, 2021, ADV NEUR IN

[7] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[8]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[9]

Ge Z, 2021, Arxiv, DOI arXiv:2107.08430

[10]

Gkioxari G, 2014, Arxiv, DOI arXiv:1406.5212

← 1 2 3 4 5 →