Class-Aware Self- and Cross-Attention Network for Few-Shot Semantic Segmentation of Remote Sensing Images

被引:1
作者
Liang, Guozhen [1 ]
Xie, Fengxi [1 ]
Chien, Ying-Ren [2 ]
机构
[1] Tech Univ Berlin, Dept Elect Engn & Comp Sci, D-10623 Berlin, Germany
[2] Natl Ilan Univ, Dept Elect Engn, Yilan 260007, Taiwan
关键词
few-shot learning; few-shot semantic segmentation; remote sensing; class-aware self- and cross-attention;
D O I
10.3390/math12172761
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Few-Shot Semantic Segmentation (FSS) has drawn massive attention recently due to its remarkable ability to segment novel-class objects given only a handful of support samples. However, current FSS methods mainly focus on natural images and pay little attention to more practical and challenging scenarios, e.g., remote sensing image segmentation. In the field of remote sensing image analysis, the characteristics of remote sensing images, like complex backgrounds and tiny foreground objects, make novel-class segmentation challenging. To cope with these obstacles, we propose a Class-Aware Self- and Cross-Attention Network (CSCANet) for FSS in remote sensing imagery, consisting of a lightweight self-attention module and a supervised prior-guided cross-attention module. Concretely, the self-attention module abstracts robust unseen-class information from support features, while the cross-attention module generates a superior quality query attention map for directing the network to focus on novel objects. Experiments demonstrate that our CSCANet achieves outstanding performance on the standard remote sensing FSS benchmark iSAID-5i, surpassing the existing state-of-the-art FSS models across all combinations of backbone networks and K-shot settings.
引用
收藏
页数:14
相关论文
共 45 条
[1]  
Boyu Yang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12353), P763, DOI 10.1007/978-3-030-58598-3_45
[2]  
Chen ZT, 2019, AAAI CONF ARTIF INTE, P3379
[3]  
Cheng Ouyang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12374), P762, DOI 10.1007/978-3-030-58526-6_45
[4]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[5]  
Finn C, 2017, PR MACH LEARN RES, V70
[6]  
Haochen Wang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12358), P730, DOI 10.1007/978-3-030-58601-0_43
[7]   CCNet: Criss-Cross Attention for Semantic Segmentation [J].
Huang, Zilong ;
Wang, Xinggang ;
Huang, Lichao ;
Huang, Chang ;
Wei, Yunchao ;
Liu, Wenyu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :603-612
[8]   Task Agnostic Meta-Learning for Few-Shot Learning [J].
Jamal, Muhammad Abdullah ;
Qi, Guo-Jun .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11711-11719
[9]   Few-Shot Segmentation of Remote Sensing Images Using Deep Metric Learning [J].
Jiang, Xufeng ;
Zhou, Nan ;
Li, Xiang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[10]  
Jindal Swati, 2023, Proc Mach Learn Res, V210, P37