Improved Model Integrating FPN with Refined IoU for Efficient Object Detection Algorithm in Remote Sensing Satellite Images

被引：1

作者：

Abd Elhamied, Essam M. ^{[1
]}

Youssef, Sherin M. ^{[2
]}

El Shenawy, Marwa ^{[2
]}

机构：

[1] Arab Acad Sci & Technol AASTMT, Informat & Documentat Ctr, Alexandria, Egypt

[2] Arab Acad Sci & Technol AASTMT, Comp Engn Dept, Alexandria, Egypt

来源：

2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND SMART INNOVATION, ICMISI 2024 | 2024年

关键词：

YOLOv8s; Small target detection; Remote sensing images; DIOR; Sensing;

D O I：

10.1109/ICMISI61517.2024.10580024

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Common detection approaches for recognizing small targets are ineffective due to inherent constraints in remote sensing image, including noise and a lack of specific information about small targets. This paper presents a new model for improving the detection accuracy of small targets in remote sensing images. The initial modifications to the feature extraction stage and introducing an architecture attention mechanism. Furthermore, a transformer block is utilized to enhance the representation of the feature map. The discriminative information extraction is enhanced by employing a distinctive attention-guided bidirectional feature pyramid network. This is accomplished by carefully pulling properties from the superficial network using a dynamic and sparse attention technique. Furthermore, top-down pathways are used to improve feature integration into the subsequent network modules. A Rectified Intersection Over Union loss function is introduced to specifically handle the limitations of the loss function, hence enhancing the alignment between the detected and ground-truth bounding boxes in terms of maintaining consistent shapes. Empirical evaluations on the DIOR, VHR-10 and VisDrone2019 datasets provide empirical confirmation of Improved-YOLOv8s performance, with considerable increases in mean Average Precision (mAP) for small targets, overall mAP, model parameters, and Frames Per Second (FPS). The findings demonstrate the efficacy of the modifications made in our adaptation of the original YOLOv8s model. The application of these strategies significantly enhances the performance of the proposed algorithm in detecting small targets in remote sensing images. A comparative evaluation of the original YOLOv8s architecture indicates considerable improvements in recognition accuracy.

引用

页码：244 / 250

页数：7

共 29 条

[1]

Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934]

[2]

Cai Z., 2018, IEEE CVF C COMP VIS, P9488

[3] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[4] Object Detection with Discriminatively Trained Part-Based Models [J].

Felzenszwalb, Pedro F. ;

Girshick, Ross B. ;

McAllester, David ;

Ramanan, Deva .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) :1627-1645

[5] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[6] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[7]

Hongkai Zhang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12360), P260, DOI 10.1007/978-3-030-58555-6_16

[8]

Li H., 2019, IEEE Transactions on Image Processing, V28, P3508

[9]

Lim S., 2020, IEEE CVF C COMP VIS, P10325

[10]

Lin Y., 2018, IEEE INT C COMP VIS, P439

← 1 2 3 →