Learning self-target knowledge for few-shot segmentation

被引：6

作者：

Chen, Yadang ^{[1
,2
]}

Chen, Sihan ^{[1
,2
]}

Yang, Zhi-Xin ^{[3
]}

Wu, Enhua ^{[4
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China

[2] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China

[3] Univ Macau, Dept Electromech Engn, State Key Lab Internet Things Smart City, Macau 999078, Peoples R China

[4] Univ Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 149卷

基金：

中国国家自然科学基金;

关键词：

Few-shot segmentation; Two-level similarity matching; Step-by-step mining; Attention mechanism;

D O I：

10.1016/j.patcog.2024.110266

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot semantic segmentation uses a few annotated data of a specific class in the support set to segment the target of the same class in the query set. Most existing approaches fail to perform well when there are significant intra-class variances. This paper alleviates the problem by concentrating on mining the query image and using the support set as supplementary information. First, it proposes a Query Prototype Generation Module to generate a query foreground prototype from the query features. Specifically, we use both prototypelevel and pixel-level similarity matching to generate two complementary initial prototypes, which we then integrate to create a discriminative query foreground prototype. Second, we propose a Support Auxiliary Refinement Module to further guide the final precise prediction of the query image by leveraging the target category information of the support set through step -by-step mining. Specifically, we generate a query-support mixture prototype based on the support prototype representation obtained using the attention mechanism. Then we generate a support supplement prototype to complement the missing information by encoding over the foreground regions that the query-support mixture prototype fails to segment out. Extensive experiments on PASCAL-5 ' and COCO-20(iota). demonstrate that our model outperforms the prior works of few-shot segmentation.

引用

页数：14

共 42 条

[1]

Chen J, 2021, arXiv

[2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].