Generalization-Enhanced Few-Shot Object Detection in Remote Sensing

被引:4
作者
Lin, Hui [1 ]
Li, Nan [1 ]
Yao, Pengjuan [2 ,3 ]
Dong, Kexin [1 ]
Guo, Yuhan [4 ]
Hong, Danfeng [5 ,6 ]
Zhang, Ying [7 ]
Wen, Congcong [8 ,9 ]
机构
[1] China Acad Elect & Informat Technol, Beijing 100846, Peoples R China
[2] China Meteorol Adm, Natl Satellite Meteorol Ctr, Beijing 100081, Peoples R China
[3] China Meteorol Adm, Innovat Ctr FengYun Meteorol Satellite, Beijing 100081, Peoples R China
[4] Tsinghua Univ, Dept Hydraul Engn, State Key Lab Hydrosci & Engn, Beijing 100084, Peoples R China
[5] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[6] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
[7] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing, Peoples R China
[8] New York Univ Abu Dhabi, Dept Elect & Comp Engn, Abu Dhabi, U Arab Emirates
[9] Univ Sci & Technol China, Sch Cyber Sci & Technol, Hefei 230052, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; few-shot object detection; remote sensing; few-shot learning;
D O I
10.1109/TCSVT.2025.3528262
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object detection is a fundamental task in computer vision that involves accurately locating and classifying objects within images or video frames. In remote sensing, this task is particularly challenging due to the high resolution, multi-scale features, and diverse ground object characteristics inherent in satellite and UAV imagery. These challenges necessitate more advanced approaches for effective object detection in such environments. While deep learning methods have achieved remarkable success in remote sensing object detection, they typically rely on large amounts of labeled data. Acquiring sufficient labeled data, particularly for novel or rare objects, is both challenging and time-consuming in remote sensing scenarios, limiting the generalization capabilities of existing models. To address these challenges, few-shot learning (FSL) has emerged as a promising approach, aiming to enable models to learn new classes from limited labeled examples. Building on this concept, few-shot object detection (FSOD) specifically targets object detection challenges in data-limited conditions. However, the generalization capability of FSOD models, particularly in remote sensing, is often constrained by the complex and diverse characteristics of the objects present in such environments. In this paper, we propose the Generalization-Enhanced Few-Shot Object Detection (GE-FSOD) model to improve the generalization capability in remote sensing FSOD tasks. Our model introduces three key innovations: the Cross-Level Fusion Pyramid Attention Network (CFPAN) for enhanced multi-scale feature representation, the Multi-Stage Refinement Region Proposal Network (MRRPN) for more accurate region proposals, and the Generalized Classification Loss (GCL) for improved classification performance in few-shot scenarios. GE-FSOD demonstrates superior robustness and accuracy in remote sensing FSOD tasks through these enhancements. Extensive experiments on the DIOR and NWPU VHR-10 datasets show that our model achieves state-of-the-art performance, significantly advancing the field of few-shot object detection in remote sensing. The source code is available at (https://github.com/leenamx/GE-FSOD).
引用
收藏
页码:5445 / 5460
页数:16
相关论文
共 73 条
[31]   Object detection in optical remote sensing images: A survey and a new benchmark [J].
Li, Ke ;
Wan, Gang ;
Cheng, Gong ;
Meng, Liqiu ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 159 (159) :296-307
[32]   Synchrosqueezing Polynomial Chirplet Transform and Its Application in Tight Sandstone Gas Reservoir Identification [J].
Li, Rui ;
Chen, Hui ;
Fang, Yuxia ;
Hu, Ying ;
Chen, Xuping ;
Li, Jun .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[33]  
Li Xiang, 2023, International Journal of Applied Earth Observation and Geoinformation, V124, DOI 10.1016/j.jag.2023.103497
[34]   Vision-Language Models in Remote Sensing: Current progress and future trends [J].
Li, Xiang ;
Wen, Congcong ;
Hu, Yuan ;
Yuan, Zhenghang ;
Zhu, Xiao Xiang .
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2024, 12 (02) :32-66
[35]  
Li X, 2022, IEEE T GEOSCI REMOTE, V60, DOI [10.1109/TGRS.2021.3051383, 10.1109/TGRS.2022.3201206]
[36]  
Li ZG, 2017, Arxiv, DOI [arXiv:1707.09835, DOI 10.48550/ARXIV.1707.09835]
[37]  
Lin H, 2025, Arxiv, DOI arXiv:2411.01595
[38]   Few-Shot Object Detection in Remote-Sensing Images via Label-Consistent Classifier and Gradual Regression [J].
Liu, Yanxing ;
Pan, Zongxu ;
Yang, Jianwei ;
Zhang, Bingchen ;
Zhou, Guangyao ;
Hu, Yuxin ;
Ye, Qixiang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-14
[39]   Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks [J].
Long, Yang ;
Gong, Yiping ;
Xiao, Zhifeng ;
Liu, Qing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (05) :2486-2498
[40]   A Metric-Based Few-Shot Learning Method for Fish Species Identification with Limited Samples [J].
Lu, Jiamin ;
Zhang, Song ;
Zhao, Shili ;
Li, Daoliang ;
Zhao, Ran .
ANIMALS, 2024, 14 (05)