Learning self-target knowledge for few-shot segmentation

被引:6
作者
Chen, Yadang [1 ,2 ]
Chen, Sihan [1 ,2 ]
Yang, Zhi-Xin [3 ]
Wu, Enhua [4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[3] Univ Macau, Dept Electromech Engn, State Key Lab Internet Things Smart City, Macau 999078, Peoples R China
[4] Univ Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot segmentation; Two-level similarity matching; Step-by-step mining; Attention mechanism;
D O I
10.1016/j.patcog.2024.110266
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot semantic segmentation uses a few annotated data of a specific class in the support set to segment the target of the same class in the query set. Most existing approaches fail to perform well when there are significant intra-class variances. This paper alleviates the problem by concentrating on mining the query image and using the support set as supplementary information. First, it proposes a Query Prototype Generation Module to generate a query foreground prototype from the query features. Specifically, we use both prototypelevel and pixel-level similarity matching to generate two complementary initial prototypes, which we then integrate to create a discriminative query foreground prototype. Second, we propose a Support Auxiliary Refinement Module to further guide the final precise prediction of the query image by leveraging the target category information of the support set through step -by-step mining. Specifically, we generate a query-support mixture prototype based on the support prototype representation obtained using the attention mechanism. Then we generate a support supplement prototype to complement the missing information by encoding over the foreground regions that the query-support mixture prototype fails to segment out. Extensive experiments on PASCAL-5 ' and COCO-20(iota). demonstrate that our model outperforms the prior works of few-shot segmentation.
引用
收藏
页数:14
相关论文
共 42 条
[1]  
Boyu Yang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12353), P763, DOI 10.1007/978-3-030-58598-3_45
[2]  
Chen Jiacheng, 2021, arXiv
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]   Fast target-aware learning for few-shot video object segmentation [J].
Chen, Yadang ;
Hao, Chuanyan ;
Yang, Zhi-Xin ;
Wu, Enhua .
SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (08)
[5]   Meta-Learning-Based Incremental Few-Shot Object Detection [J].
Cheng, Meng ;
Wang, Hanli ;
Long, Yu .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) :2158-2169
[6]   Self-regularized prototypical network for few-shot semantic segmentation [J].
Ding, Henghui ;
Zhang, Hui ;
Jiang, Xudong .
PATTERN RECOGNITION, 2023, 133
[7]  
Dong N., 2018, BMVC, V3, P79
[8]   Learning Robust Discriminant Subspace Based on Joint L2, p- and L2,s-Norm Distance Metrics [J].
Fu, Liyong ;
Li, Zechao ;
Ye, Qiaolin ;
Yin, Hang ;
Liu, Qingwang ;
Chen, Xiaobo ;
Fan, Xijian ;
Yang, Wankou ;
Yang, Guowei .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) :130-144
[9]   Incremental Few-Shot Instance Segmentation [J].
Ganea, Dan Andrei ;
Boom, Bas ;
Poppe, Ronald .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1185-1194
[10]  
Haochen Wang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12358), P730, DOI 10.1007/978-3-030-58601-0_43