Not Just Learning From Others but Relying on Yourself: A New Perspective on Few-Shot Segmentation in Remote Sensing

被引:16
作者
Bi, Hanbo [1 ,2 ,3 ]
Feng, Yingchao [1 ,3 ]
Yan, Zhiyuan [1 ,3 ]
Mao, Yongqiang [1 ,2 ,3 ]
Diao, Wenhui [1 ,3 ]
Wang, Hongqi [1 ,3 ]
Sun, Xian [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Few-shot learning; few-shot segmentation (FSS); prototype learning; remote sensing; semantic segmentation;
D O I
10.1109/TGRS.2023.3326292
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot segmentation (FSS) is proposed to segment unknown class targets with just a few annotated samples. Most current FSS methods follow the paradigm of mining the semantics from the support images to guide the query image segmentation. However, such a pattern of "learning from others" struggles to handle the extreme intraclass variation, preventing FSS from being directly generalized to remote sensing scenes. To bridge the gap of intraclass variance, we develop a dual-mining network named DMNet for cross-image mining and self-mining, meaning that it no longer focuses solely on support images but pays more attention to the query image itself. Specifically, we propose a class-public region mining (CPRM) module to effectively suppress irrelevant feature pollution by capturing the common semantics between the support-query image pair. The class-specific region mining (CSRM) module is then proposed to continuously mine the class-specific semantics of the query image itself in a "filtering" and "purifying" manner. In addition, to prevent the coexistence of multiple classes in remote sensing scenes from exacerbating the collapse of FSS generalization, we also propose a new known-class metasuppressor (KMS) module to suppress the activation of known-class objects in the sample. Extensive experiments on the iSAID and LoveDA remote sensing datasets have demonstrated that our method sets the state of the art with a minimum number of model parameters. Significantly, our model with the backbone of Resnet-50 achieves the mean Intersection over Union (mIoU) of 49.58% and 51.34% on iSAID under 1- and 5-shot settings, outperforming the state-of-the-art method by 1.8% and 1.12%, respectively. The code is publicly available at https://github.com/HanboBizl/DMNet/.
引用
收藏
页数:21
相关论文
共 75 条
[1]  
[Anonymous], 2018, Urban remote sensing
[2]  
Chen LC, 2016, Arxiv, DOI arXiv:1412.7062
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]  
Chen WY, 2020, Arxiv, DOI arXiv:1904.04232
[5]   Semi-Supervised Contrastive Learning for Few-Shot Segmentation of Remote Sensing Images [J].
Chen, Yadang ;
Wei, Chenchen ;
Wang, Duolin ;
Ji, Chuanjun ;
Li, Baozhu .
REMOTE SENSING, 2022, 14 (17)
[6]   Holistic Prototype Activation for Few-Shot Segmentation [J].
Cheng, Gong ;
Lang, Chunbo ;
Han, Junwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) :4650-4666
[7]   SPNet: Siamese-Prototype Network for Few-Shot Remote Sensing Image Scene Classification [J].
Cheng, Gong ;
Cai, Liming ;
Lang, Chunbo ;
Yao, Xiwen ;
Chen, Jinyong ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]  
Dey V, 2010, INT ARCH PHOTOGRAMM, V38, P31
[10]  
Dong N., 2018, BRIT MACHINE VISION, P79