Class-Aware Self- and Cross-Attention Network for Few-Shot Semantic Segmentation of Remote Sensing Images

被引：1

作者：

Liang, Guozhen ^{[1
]}

Xie, Fengxi ^{[1
]}

Chien, Ying-Ren ^{[2
]}

机构：

[1] Tech Univ Berlin, Dept Elect Engn & Comp Sci, D-10623 Berlin, Germany

[2] Natl Ilan Univ, Dept Elect Engn, Yilan 260007, Taiwan

来源：

MATHEMATICS | 2024年 / 12卷 / 17期

关键词：

few-shot learning; few-shot semantic segmentation; remote sensing; class-aware self- and cross-attention;

D O I：

10.3390/math12172761

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Few-Shot Semantic Segmentation (FSS) has drawn massive attention recently due to its remarkable ability to segment novel-class objects given only a handful of support samples. However, current FSS methods mainly focus on natural images and pay little attention to more practical and challenging scenarios, e.g., remote sensing image segmentation. In the field of remote sensing image analysis, the characteristics of remote sensing images, like complex backgrounds and tiny foreground objects, make novel-class segmentation challenging. To cope with these obstacles, we propose a Class-Aware Self- and Cross-Attention Network (CSCANet) for FSS in remote sensing imagery, consisting of a lightweight self-attention module and a supervised prior-guided cross-attention module. Concretely, the self-attention module abstracts robust unseen-class information from support features, while the cross-attention module generates a superior quality query attention map for directing the network to focus on novel objects. Experiments demonstrate that our CSCANet achieves outstanding performance on the standard remote sensing FSS benchmark iSAID-5i, surpassing the existing state-of-the-art FSS models across all combinations of backbone networks and K-shot settings.

引用

页数：14

共 45 条

[1]

Boyu Yang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12353), P763, DOI 10.1007/978-3-030-58598-3_45

[2]

Chen ZT, 2019, AAAI CONF ARTIF INTE, P3379

[3]

Cheng Ouyang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12374), P762, DOI 10.1007/978-3-030-58526-6_45

[4]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[5]

Finn C, 2017, PR MACH LEARN RES, V70

[6]

Haochen Wang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12358), P730, DOI 10.1007/978-3-030-58601-0_43

[7] CCNet: Criss-Cross Attention for Semantic Segmentation [J].

Huang, Zilong ;

Wang, Xinggang ;

Huang, Lichao ;

Huang, Chang ;

Wei, Yunchao ;

Liu, Wenyu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :603-612

[8] Task Agnostic Meta-Learning for Few-Shot Learning [J].

Jamal, Muhammad Abdullah ;

Qi, Guo-Jun .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11711-11719

[9] Few-Shot Segmentation of Remote Sensing Images Using Deep Metric Learning [J].

Jiang, Xufeng ;

Zhou, Nan ;

Li, Xiang .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19

[10]

Jindal Swati, 2023, Proc Mach Learn Res, V210, P37

← 1 2 3 4 5 →