Remote Sensing Object Detection Based on Strong Feature Extraction and Prescreening Network

被引:32
作者
Li, Mengyuan [1 ]
Cao, Changqing [1 ]
Feng, Zhejun [1 ]
Xu, Xiangkai [1 ]
Wu, Zengyan [1 ]
Ye, Shubing [1 ]
Yong, Jiawei [1 ]
机构
[1] Xidian Univ, Sch Optoelect Engn, Xian 710071, Peoples R China
关键词
Feature extraction; Remote sensing; Object detection; Decoding; Predictive models; Deformable models; Data mining; A-D loss function; deformable end-to-end object detection with transformers (DETR); high-resolution remote sensing images (HRRSIs); multiscale deformable prescreening attention (MSDPA); multiscale split-attention (MSSA); small objects;
D O I
10.1109/LGRS.2023.3236777
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Remote sensing object detection has been an important and challenging research hot spot in computer vision that is widely used in military and civilian fields. Recently, the combined detection model of convolutional neural network (CNN) and transformer has achieved good results, but the problem of poor detection performance of small objects still needs to be solved urgently. This letter proposes a deformable end-to-end object detection with transformers (DETR)-based framework for object detection in remote sensing images. First, multiscale split attention (MSSA) is designed to extract more detailed feature information by grouping. Next, we propose multiscale deformable prescreening attention (MSDPA) mechanism in decoding layer, which achieves the purpose of prescreening, so that the encoder-decoder structure can obtain attention map more efficiently. Finally, the A-D loss function is applied to the prediction layer, increasing the attention of small objects and optimizing the intersection over union (IOU) function. We conduct extensive experiments on the DOTA v1.5 dataset and the HRRSD dataset, which show that the reconstructed detection model is more suitable for remote sensing objects, especially for small objects. The average detection accuracy in DOTA dataset has improved by 4.4% (up to 75.6%), especially the accuracy of small objects has raised by 5%.
引用
收藏
页数:5
相关论文
共 25 条
[1]   VQA: Visual Question Answering [J].
Agrawal, Aishwarya ;
Lu, Jiasen ;
Antol, Stanislaw ;
Mitchell, Margaret ;
Zitnick, C. Lawrence ;
Parikh, Devi ;
Batra, Dhruv .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 123 (01) :4-31
[2]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[3]  
Carion N, 2020, Arxiv, DOI [arXiv:2005.12872, DOI 10.48550/ARXIV.2005.12872]
[4]   Remote Sensing Object Tracking With Deep Reinforcement Learning Under Occlusion [J].
Cui, Yanyu ;
Hou, Biao ;
Wu, Qian ;
Ren, Bo ;
Wang, Shuang ;
Jiao, Licheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[5]   A Multi-Scale Spatial Attention Region Proposal Network for High-Resolution Optical Remote Sensing Imagery [J].
Dong, Ruchan ;
Jiao, Licheng ;
Zhang, Yan ;
Zhao, Jin ;
Shen, Weiyan .
REMOTE SENSING, 2021, 13 (17)
[6]   Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning [J].
Han, Junwei ;
Zhang, Dingwen ;
Cheng, Gong ;
Guo, Lei ;
Ren, Jinchang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (06) :3325-3337
[7]   Deep networks under scene-level supervision for multi-class geospatial object detection from remote sensing images [J].
Li, Yansheng ;
Zhang, Yongjun ;
Huang, Xin ;
Yuille, Alan L. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 146 :182-196
[8]  
Lin TY, 2017, Arxiv, DOI [arXiv:1612.03144, DOI 10.48550/ARXIV.1612.03144, 10.48550/ARXIV.1612.03144, 10.48550/arxiv.1612.03144]
[9]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) :318-327
[10]  
Meng DP, 2023, Arxiv, DOI arXiv:2108.06152