Object Detection in Remote Sensing Imagery Based on Prototype Learning Network With Proposal Relation

被引:2
作者
Ni, Kang [1 ,2 ,3 ,4 ]
Ma, Tengfei [5 ,6 ]
Zheng, Zhizhong [1 ,4 ]
Wang, Peng [2 ,3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[2] Huzhou Key Lab Urban Multidimens Percept & Intelli, Huzhou 313000, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Key Lab Radar Imaging & Microwave Photon, Minist Educ, Nanjing 211106, Peoples R China
[4] Jiangsu Prov Engn Res Ctr Airborne Detecting & Int, Nanjing 210049, Peoples R China
[5] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Peoples R China
[6] Nanjing Univ Posts & Telecommun, Coll Artificial Intelligence, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Remote sensing; Feature extraction; Object detection; Proposals; Prototypes; Semantics; Contrastive learning; Deep learning; object detection; receptive field; remote sensing; structural relationships;
D O I
10.1109/TIM.2024.3451572
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning object detection algorithms, due to their powerful feature learning capabilities, can effectively improve the accuracy of target detection in remote sensing images. However, remote sensing image target detection faces challenges such as dense arrangement of small targets and complex backgrounds. Addressing the above issues, how to enhance the receptive field while effectively depicting the structural relationships between proposals will be beneficial for detecting small targets in remote sensing images with complex backgrounds. Motivated by this, a prototype learning network with proposal relation, called PLNet-PR, is proposed for remote sensing object detection, while enhancing receptive fields. The shift operation is inserted into the inception module and spatial graph convolution layer, constructing sparse shift selective convolution (S3Conv) based on spatial-channel selective attention mechanism, and graph-guided proposal-relation learning module (GPRLM), for enhancing the characterization of small targets and acquiring powerful proposal-level feature relations of remote sensing targets. Furthermore, a category prototype repository (CPRep) with a class-wise semantic attention (CWSA) block is proposed for the improved proposal generation between different remote sensing object categories. Our extensive experiments validate the effectiveness of PLNet-PR which outperforms other related deep learning methods. Codes are available: https://github.com/RSIP-NJUPT/PLNet-PR.
引用
收藏
页数:16
相关论文
共 85 条
  • [1] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934]
  • [2] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [3] Multiscale Object Contrastive Learning-Derived Few-Shot Object Detection in VHR Imagery
    Chen, Jie
    Qin, Dengda
    Hou, Dongyang
    Zhang, Jun
    Deng, Min
    Sun, Geng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] Guiding Clean Features for Object Detection in Remote Sensing Images
    Cheng, Gong
    He, Min
    Hong, Hailong
    Yao, Xiwen
    Qian, Xiaoliang
    Guo, Lei
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [5] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
  • [6] Variational Prototype Learning for Deep Face Recognition
    Deng, Jiankang
    Guo, Jia
    Yang, Jing
    Lattas, Alexandros
    Zafeiriou, Stefanos
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11901 - 11910
  • [7] Learning RoI Transformer for Oriented Object Detection in Aerial Images
    Ding, Jian
    Xue, Nan
    Long, Yang
    Xia, Gui-Song
    Lu, Qikai
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
  • [8] Multiscale Deformable Attention and Multilevel Features Aggregation for Remote Sensing Object Detection
    Dong, Xiaohu
    Qin, Yao
    Fu, Ruigang
    Gao, Yinghui
    Liu, Songlin
    Ye, Yuanxin
    Li, Biao
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [9] Remote Sensing Object Detection Based on Receptive Field Expansion Block
    Dong, Xiaohu
    Fu, Ruigang
    Gao, Yinghui
    Qin, Yao
    Ye, Yuanxin
    Li, Biao
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [10] Progressive Contextual Instance Refinement for Weakly Supervised Object Detection in Remote Sensing Images
    Feng, Xiaoxu
    Han, Junwei
    Yao, Xiwen
    Cheng, Gong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (11): : 8002 - 8012