Few-Shot Rotation-Invariant Aerial Image Semantic Segmentation

被引:11
作者
Cao, Qinglong [1 ,2 ]
Chen, Yuntian [2 ,3 ]
Ma, Chao [1 ]
Yang, Xiaokang [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[2] Eastern Inst Technol, Ningbo Inst Digital Twin, Ningbo 315200, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
美国国家科学基金会;
关键词
Consistent prediction; few-shot aerial semantic segmentation; rotation invariance; rotation-adaptive matching; NETWORK;
D O I
10.1109/TGRS.2023.3338699
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot aerial image semantic segmentation is a challenging task that requires precisely parsing unseen-category objects in query aerial images with limited annotated support aerial images. Formally, category prototypes would be extracted from support samples to segment query images in a pixel-to-pixel matching manner. However, aerial objects in aerial images are often distributed with arbitrary orientations, and varying orientations could cause a dramatic feature change. This unique property of aerial images renders conventional matching manner without consideration of orientations fails to activate same-category objects with different orientations. Furthermore, the oscillation of the confidence scores in existing rotation-insensitive algorithms, engendered by the striking changes of object orientations, often leads to false recognition of lower scored rotated semantic objects. To tackle these challenges, inspired by the intrinsic rotation invariance in aerial images, we propose a novel few-shot rotation-invariant aerial semantic segmentation network (FRINet) to efficiently segment aerial semantic objects with diverse orientations. Specifically, through extracting orientation-varying yet category-consistent support information, FRINet provides rotation-adaptive matching for each query feature in a feature-aggregation manner. Meanwhile, to encourage consistent predictions for aerial objects with arbitrary orientations, segmentation predictions from different orientations are supervised by the same label and further fused to obtain the final rotation-invariant prediction in a complementary manner. Moreover, aiming at providing a better solution searching space, the backbones are newly pretrained in the base category to basically boost the segmentation performance. Extensive experiments on the few-shot aerial image semantic segmentation benchmark demonstrate that the proposed FRINet achieves a new state-of-the-art performance. The code is available at https://github.com/caoql98/FRINet.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 63 条
[1]   A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery [J].
Ayala, Christian ;
Sesma, Ruben ;
Aranda, Carlos ;
Galar, Mikel .
REMOTE SENSING, 2021, 13 (16)
[2]   Aerial image semantic segmentation using DCNN predicted distance maps [J].
Chai, Dengfeng ;
Newsam, Shawn ;
Huang, Jingfeng .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 161 :309-322
[3]  
Chaudhuri B, 2018, IEEE T GEOSCI REMOTE, V56, P1144, DOI [10.1109/TGRS.2017.2760909, 10.1109/tgrs.2017.2760909]
[4]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[5]   Holistic Prototype Activation for Few-Shot Segmentation [J].
Cheng, Gong ;
Lang, Chunbo ;
Han, Junwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) :4650-4666
[6]   SPNet: Siamese-Prototype Network for Few-Shot Remote Sensing Image Scene Classification [J].
Cheng, Gong ;
Cai, Liming ;
Lang, Chunbo ;
Yao, Xiwen ;
Chen, Jinyong ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7]   Prototype-CNN for Few-Shot Object Detection in Remote Sensing Images [J].
Cheng, Gong ;
Yan, Bowei ;
Shi, Peizhen ;
Li, Ke ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[8]   CCANet: Class-Constraint Coarse-to-Fine Attentional Deep Network for Subdecimeter Aerial Image Semantic Segmentation [J].
Deng, Guohui ;
Wu, Zhaocong ;
Wang, Chengjun ;
Xu, Miaozhong ;
Zhong, Yanfei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]   ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data [J].
Diakogiannis, Foivos, I ;
Waldner, Francois ;
Caccetta, Peter ;
Wu, Chen .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 162 :94-114