PHSI-RTDETR: A Lightweight Infrared Small Target Detection Algorithm Based on UAV Aerial Photography

被引：29

作者：

Wang, Sen ^{[1
,2
]}

Jiang, Huiping ^{[1
,2
]}

Li, Zhongjie ^{[1
,2
]}

Yang, Jixiang ^{[1
,2
]}

Ma, Xuan ^{[1
,2
]}

Chen, Jiamin ^{[1
,2
]}

Tang, Xingqun ^{[1
,2
]}

机构：

[1] Governance MOE, Key Lab Ethn Language Intelligent Anal & Secur, Beijing 100081, Peoples R China

[2] Minzu Univ China, Sch Informat Engn, Beijing 100081, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 06期

基金：

中国国家自然科学基金;

关键词：

small infrared target; UAV; RT-DETR; lightweight structure; partial convolution; HiLo attention; slimneck; Inner-GIoU;

D O I：

10.3390/drones8060240

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

To address the issues of low model accuracy caused by complex ground environments and uneven target scales and high computational complexity in unmanned aerial vehicle (UAV) aerial infrared image target detection, this study proposes a lightweight UAV aerial infrared small target detection algorithm called PHSI-RTDETR. Initially, an improved backbone feature extraction network is designed using the lightweight RPConv-Block module proposed in this paper, which effectively captures small target features, significantly reducing the model complexity and computational burden while improving accuracy. Subsequently, the HiLo attention mechanism is combined with an intra-scale feature interaction module to form an AIFI-HiLo module, which is integrated into a hybrid encoder to enhance the focus of the model on dense targets, reducing the rates of missed and false detections. Moreover, the slimneck-SSFF architecture is introduced as the cross-scale feature fusion architecture of the model, utilizing GSConv and VoVGSCSP modules to enhance adaptability to infrared targets of various scales, producing more semantic information while reducing network computations. Finally, the original GIoU loss is replaced with the Inner-GIoU loss, which uses a scaling factor to control auxiliary bounding boxes to speed up convergence and improve detection accuracy for small targets. The experimental results show that, compared to RT-DETR, PHSI-RTDETR reduces model parameters by 30.55% and floating-point operations by 17.10%. Moreover, detection precision and speed are increased by 3.81% and 13.39%, respectively, and mAP50, impressively, reaches 82.58%, demonstrating the great potential of this model for drone infrared small target detection.

引用

页数：21

共 48 条

[1] Derivative Entropy-Based Contrast Measure for Infrared Small-Target Detection [J].

Bai, Xiangzhi ;

Bi, Yanguang .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (04) :2452-2466

[2] Analysis of new top-hat transformation and the application for infrared dim small target detection [J].

Bai, Xiangzhi ;

Zhou, Fugen .

PATTERN RECOGNITION, 2010, 43 (06) :2145-2156

[3] The Use of UAVs in Humanitarian Relief: An Application of POMDP-Based Methodology for Finding Victims [J].

Bittencourt Bravo, Raissa Zurli ;

Leiras, Adriana ;

Cyrino Oliveira, Fernando Luiz .

PRODUCTION AND OPERATIONS MANAGEMENT, 2019, 28 (02) :421-440

[4] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[5] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks [J].

Chen, Jierun ;

Kao, Shiu-Hong ;

He, Hao ;

Zhuo, Weipeng ;

Wen, Song ;

Lee, Chul-Ho ;

Chan, S. -H. Gary .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12021-12031

[6] A lightweight multi-feature fusion network for unmanned aerial vehicle infrared ray image object detection [J].

Chen, Yunlei ;

Liu, Ziyan ;

Zhang, Lihui ;

Wu, Yingyu ;

Zhang, Qian ;

Zheng, Xuhui .

EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2024, 27 (02) :268-276

[7] Max-Mean and Max-Median filters for detection of small-targets [J].

Deshpande, SD ;

Er, MH ;

Ronda, V ;

Chan, P .

SIGNAL AND DATA PROCESSING OF SMALL TARGETS 1999, 1999, 3809 :74-83

[8] RepVGG: Making VGG-style ConvNets Great Again [J].

Ding, Xiaohan ;

Zhang, Xiangyu ;

Ma, Ningning ;

Han, Jungong ;

Ding, Guiguang ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13728-13737

[9]

Gevorgyan Z, 2022, Arxiv, DOI arXiv:2205.12740

[10] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

← 1 2 3 4 5 →