Transformation-Invariant Network for Few-Shot Object Detection in Remote-Sensing Images

被引:26
作者
Liu, Nanqing [1 ]
Xu, Xun [1 ,2 ]
Celik, Turgay [1 ,3 ]
Gan, Zongxin [1 ]
Li, Heng-Chao [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Peoples R China
[2] ASTAR, I2R, Singapore 138632, Singapore
[3] Univ Witwatersrand, Sch Elect & Informat Engn, ZA-2000 Johannesburg, South Africa
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Object detection; Training; Remote sensing; Feature extraction; Task analysis; Metalearning; Airplanes; Few-shot learning; meta-learning; object detection; remote-sensing images (RSIs); transformation invariance;
D O I
10.1109/TGRS.2023.3332652
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Object detection in remote-sensing images (RSIs) relies on a large amount of labeled data for training. However, the increasing number of new categories and class imbalance make exhaustive annotation impractical. Few-shot object detection (FSOD) addresses this issue by leveraging meta-learning on seen base classes and fine-tuning on novel classes with limited labeled samples. Nonetheless, the substantial scale and orientation variations of objects in RSIs pose significant challenges to existing FSOD methods. To overcome these challenges, we propose integrating a feature pyramid network (FPN) and utilizing prototype features to enhance query features, thereby improving existing FSOD methods. We refer to this modified FSOD approach as a Strong Baseline, which has demonstrated significant performance improvements compared to the original baselines. Furthermore, we tackle the issue of spatial misalignment caused by orientation variations between the query and support images by introducing a transformation-invariant network (TINet). TINet ensures geometric invariance and explicitly aligns the features of the query and support branches, resulting in additional performance gains while maintaining the same inference speed as the Strong Baseline. Extensive experiments on three widely used remote-sensing object detection datasets, that is, NWPU VHR-10.v2, DIOR, and HRRSD demonstrated the effectiveness of the proposed method.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 56 条
[1]   Zero-Shot Object Detection [J].
Bansal, Ankan ;
Sikka, Karan ;
Sharma, Gaurav ;
Chellappa, Rama ;
Divakaran, Ajay .
COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 :397-414
[2]   RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation Based on Visual Foundation Model [J].
Chen, Keyan ;
Liu, Chenyang ;
Chen, Hao ;
Zhang, Haotian ;
Li, Wenyuan ;
Zou, Zhengxia ;
Shi, Zhenwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-17
[3]   OvarNet: Towards Open-vocabulary Object Attribute Recognition [J].
Chen, Keyan ;
Jiang, Xiaolong ;
Hu, Yao ;
Tang, Xu ;
Gao, Yan ;
Chen, Jianqi ;
Xie, Weidi .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :23518-23527
[4]  
Chen KY, 2023, IEEE T GEOSCI REMOTE, V61, DOI [10.1109/TGRS.2023.3272473, 10.1109/TGRS.2023.3283435]
[5]   Building Extraction from Remote Sensing Images with Sparse Token Transformers [J].
Chen, Keyan ;
Zou, Zhengxia ;
Shi, Zhenwei .
REMOTE SENSING, 2021, 13 (21)
[6]  
Cheng G., 2022, IEEE Trans. Geosci. Remote Sens., V60
[7]   Holistic Prototype Activation for Few-Shot Segmentation [J].
Cheng, Gong ;
Lang, Chunbo ;
Han, Junwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) :4650-4666
[8]   SPNet: Siamese-Prototype Network for Few-Shot Remote Sensing Image Scene Classification [J].
Cheng, Gong ;
Cai, Liming ;
Lang, Chunbo ;
Yao, Xiwen ;
Chen, Jinyong ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[9]   Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images [J].
Cheng, Gong ;
Si, Yongjie ;
Hong, Hailong ;
Yao, Xiwen ;
Guo, Lei .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) :431-435
[10]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415