Generalization-Enhanced Few-Shot Object Detection in Remote Sensing

被引:5
作者
Lin, Hui [1 ]
Li, Nan [1 ]
Yao, Pengjuan [2 ,3 ]
Dong, Kexin [1 ]
Guo, Yuhan [4 ]
Hong, Danfeng [5 ,6 ]
Zhang, Ying [7 ]
Wen, Congcong [8 ,9 ]
机构
[1] China Acad Elect & Informat Technol, Beijing 100846, Peoples R China
[2] China Meteorol Adm, Natl Satellite Meteorol Ctr, Beijing 100081, Peoples R China
[3] China Meteorol Adm, Innovat Ctr FengYun Meteorol Satellite, Beijing 100081, Peoples R China
[4] Tsinghua Univ, Dept Hydraul Engn, State Key Lab Hydrosci & Engn, Beijing 100084, Peoples R China
[5] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[6] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
[7] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing, Peoples R China
[8] New York Univ Abu Dhabi, Dept Elect & Comp Engn, Abu Dhabi, U Arab Emirates
[9] Univ Sci & Technol China, Sch Cyber Sci & Technol, Hefei 230052, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; few-shot object detection; remote sensing; few-shot learning;
D O I
10.1109/TCSVT.2025.3528262
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object detection is a fundamental task in computer vision that involves accurately locating and classifying objects within images or video frames. In remote sensing, this task is particularly challenging due to the high resolution, multi-scale features, and diverse ground object characteristics inherent in satellite and UAV imagery. These challenges necessitate more advanced approaches for effective object detection in such environments. While deep learning methods have achieved remarkable success in remote sensing object detection, they typically rely on large amounts of labeled data. Acquiring sufficient labeled data, particularly for novel or rare objects, is both challenging and time-consuming in remote sensing scenarios, limiting the generalization capabilities of existing models. To address these challenges, few-shot learning (FSL) has emerged as a promising approach, aiming to enable models to learn new classes from limited labeled examples. Building on this concept, few-shot object detection (FSOD) specifically targets object detection challenges in data-limited conditions. However, the generalization capability of FSOD models, particularly in remote sensing, is often constrained by the complex and diverse characteristics of the objects present in such environments. In this paper, we propose the Generalization-Enhanced Few-Shot Object Detection (GE-FSOD) model to improve the generalization capability in remote sensing FSOD tasks. Our model introduces three key innovations: the Cross-Level Fusion Pyramid Attention Network (CFPAN) for enhanced multi-scale feature representation, the Multi-Stage Refinement Region Proposal Network (MRRPN) for more accurate region proposals, and the Generalized Classification Loss (GCL) for improved classification performance in few-shot scenarios. GE-FSOD demonstrates superior robustness and accuracy in remote sensing FSOD tasks through these enhancements. Extensive experiments on the DIOR and NWPU VHR-10 datasets show that our model achieves state-of-the-art performance, significantly advancing the field of few-shot object detection in remote sensing. The source code is available at (https://github.com/leenamx/GE-FSOD).
引用
收藏
页码:5445 / 5460
页数:16
相关论文
共 73 条
[1]  
Arthaud F, 2021, Arxiv, DOI arXiv:2103.16911
[2]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[3]  
Cai JH, 2020, Arxiv, DOI arXiv:2012.01784
[4]   Multiscale Object Contrastive Learning-Derived Few-Shot Object Detection in VHR Imagery [J].
Chen, Jie ;
Qin, Dengda ;
Hou, Dongyang ;
Zhang, Jun ;
Deng, Min ;
Sun, Geng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[5]   Multi-Level Semantic Feature Augmentation for One-Shot Learning [J].
Chen, Zitian ;
Fu, Yanwei ;
Zhang, Yinda ;
Jiang, Yu-Gang ;
Xue, Xiangyang ;
Sigal, Leonid .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) :4594-4605
[6]   Prototype-CNN for Few-Shot Object Detection in Remote Sensing Images [J].
Cheng, Gong ;
Yan, Bowei ;
Shi, Peizhen ;
Li, Ke ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7]   Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Xu, Dong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) :265-278
[8]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[9]   A survey on object detection in optical remote sensing images [J].
Cheng, Gong ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28
[10]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132