A lightweight multi-feature fusion network for unmanned aerial vehicle infrared ray image object detection

被引:3
作者
Chen, Yunlei [1 ]
Liu, Ziyan [1 ,2 ,3 ]
Zhang, Lihui [1 ]
Wu, Yingyu [1 ]
Zhang, Qian [1 ]
Zheng, Xuhui [1 ]
机构
[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Guizhou, Peoples R China
[2] Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Guizhou, Peoples R China
[3] Guizhou Univ, Chongli Bldg, Guiyang 550025, Guizhou, Peoples R China
关键词
UAV; Infrared object detection; Multi-feature fusion; YOLOX; ShuffleNetv2; NEURAL-NETWORK;
D O I
10.1016/j.ejrs.2024.03.001
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
UAV (Unmanned Aerial Vehicle) infrared object detection is crucial in pedestrian monitoring and traffic dispatch, which detects and locates objects in infrared images. In light of issues such as unnoticeable texture features and limited resolution of infrared image objects, a lightweight multi-scale feature fusion method for UAV infrared object detection is presented to enhance the performance of UAVs carrying intelligent devices to detect infrared objects. By changing the anchorless frame strategy of the YOLOX method, a lightweight MultiFeature Fusion Network (MFFNet) for UAV infrared ray (IR) image object detection is proposed. First, a lightweight backbone network is built using ShuffleNetv2_block, spatial pyramid pooling, and other modules to reduce the network's number of parameters and inference time while maintaining its capacity to extract features. Second, we develop a multi-feature fusion module to improve the detection capabilities of the model for IR objects by fusing the local features and the overall characteristics of IR objects since the texture features of IR objects are challenging to employ, but the boundary information is evident. The boundary frame regression loss is then optimized using SCYLLA-IoU (SIoU) by comparing the predicted frame to the actual frame in terms of angle, distance, shape, and IoU (Intersection over Union), which forces the model to reach the optimum predicted box more quickly. The experimental results demonstrate that our method achieves an 81.5% mean average precision (mAP) with 4.21M parameters and an inference time of only 4.84ms per image, outperforming most networks in speed and accuracy.
引用
收藏
页码:268 / 276
页数:9
相关论文
共 31 条
[1]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[2]   Infrared Small UAV Target Detection Based on Residual Image Prediction via Global and Local Dilated Residual Networks [J].
Fang, Houzhang ;
Xia, Mingjiang ;
Zhou, Gang ;
Chang, Yi ;
Yan, Luxin .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[3]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[4]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[5]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[6]   Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916
[7]   An Attention Mechanism-Improved YOLOv7 Object Detection Algorithm for Hemp Duck Count Estimation [J].
Jiang, Kailin ;
Xie, Tianyu ;
Yan, Rui ;
Wen, Xi ;
Li, Danyang ;
Jiang, Hongbo ;
Jiang, Ning ;
Feng, Ling ;
Duan, Xuliang ;
Wang, Jianjun .
AGRICULTURE-BASEL, 2022, 12 (10)
[8]  
Jocher Glenn, 2022, Zenodo
[9]   ComNet: Combinational Neural Network for Object Detection in UAV-Borne Thermal Images [J].
Li, Minglei ;
Zhao, Xingke ;
Li, Jiasong ;
Nan, Liangliang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08) :6662-6673
[10]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) :318-327