CNTR-YOLO: Improved YOLOv5 Based on ConvNext and Transformer for Aircraft Detection in Remote Sensing Images

被引:10
作者
Zhou, Fengyun [1 ]
Deng, Honggui [1 ]
Xu, Qiguo [1 ]
Lan, Xin [1 ]
机构
[1] Cent South Univ, Sch Phys & Elect, Lushan South Rd, Changsha 410083, Peoples R China
关键词
remote sensing images; aircraft detection; YOLOv5; ConvNext; Transformer; OBJECT DETECTION;
D O I
10.3390/electronics12122671
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aircraft detection in remote sensing images is an important branch of target detection due to the military value of aircraft. However, the diverse categories of aircraft and the intricate background of remote sensing images often lead to insufficient detection accuracy. Here, we present the CNTR-YOLO algorithm based on YOLOv5 as a solution to this issue. The CNTR-YOLO algorithm improves detection accuracy through three primary strategies. (1) We deploy DenseNet in the backbone to address the vanishing gradient problem during training and enhance the extraction of fundamental information. (2) The CBAM attention mechanism is integrated into the neck to minimize background noise interference. (3) The C3CNTR module is designed based on ConvNext and Transformer to clarify the target's position in the feature map from both local and global perspectives. This module is applied before the prediction head to optimize the accuracy of prediction results. Our proposed algorithm is validated on the MAR20 and DOTA datasets. The results on the MAR20 dataset show that the mean average precision (mAP) of CNTR-YOLO reached 70.1%, which is a 3.3% improvement compared with YOLOv5l. On the DOTA dataset, the results indicate that the mAP of CNTR-YOLO reached 63.7%, which is 2.5% higher than YOLOv5l.
引用
收藏
页数:18
相关论文
共 35 条
[1]   Object based image analysis for remote sensing [J].
Blaschke, T. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2010, 65 (01) :2-16
[2]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[3]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[4]   Research on Airplane and Ship Detection of Aerial Remote Sensing Images Based on Convolutional Neural Network [J].
Cao, Changqing ;
Wu, Jin ;
Zeng, Xiaodong ;
Feng, Zhejun ;
Wang, Ting ;
Yan, Xu ;
Wu, Zengyan ;
Wu, Qifan ;
Huang, Ziqiang .
SENSORS, 2020, 20 (17) :1-16
[5]   A survey on object detection in optical remote sensing images [J].
Cheng, Gong ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28
[6]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[7]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[8]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269