Cycle association prototype network for few-shot semantic segmentation

被引:1
作者
Hao, Zhuangzhuang [1 ,2 ,3 ,4 ]
Shao, Ji [1 ,2 ]
Gong, Bo [1 ,3 ]
Yang, Jingwen [5 ]
Jing, Ling [1 ,5 ]
Chen, Yingyi [1 ,2 ,3 ,4 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] China Agr Univ, Natl Innovat Ctr Digital Fishery, Beijing 100083, Peoples R China
[3] Minist Agr & Rural Affairs, Key Lab Smart Farming Technol Aquat Anim & Livesto, Beijing 100083, Peoples R China
[4] Beijing Engn & Technol Res Ctr Internet Things Agr, Beijing 100083, Peoples R China
[5] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot semantic segmentation; Semantic segmentation; Few-shot learning; Cycle-consistency learning;
D O I
10.1016/j.engappai.2024.109309
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot segmentation aims to train a segmentation model that can quickly adapt to novel classes referring to only a few annotated samples. Existing few-shot segmentation methods are based on the meta-learning strategy and extract support samples' information from a support set and then apply the information to make predictions on query images. However, most methods abstract support features into prototype vectors and ignore the crucial relationship between query and support samples. To address the problem, we propose a cycle association prototype network that focuses on pixel-wise relationships between support and query images for more accurate segmentation. Specifically, a cycle-consistent prototype module is proposed to select reliable support features and to generate prototype. To capture cross-scale relations and overcome object variations, we introduce a scale-aware prior mask generation module to offer rich guidance for objects of varying sizes and shapes via calculating the pixel-level similarity between the support and query image features. Finally, a mask generation module, which contains two parallel modules, feature fusion module and transformer decoder, is utilized to predict the query image. Extensive experiments on two datasets show that our method yields superior performance with state-of-the-art methods.
引用
收藏
页数:11
相关论文
共 55 条
[1]  
Cao L., 2022, arXiv
[2]   DRNet: Disentanglement and Recombination Network for Few-Shot Semantic Segmentation [J].
Chang, Zhaobin ;
Gao, Xiong ;
Li, Na ;
Zhou, Huiyu ;
Lu, Yonggang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) :5560-5574
[3]   Pixel Matching Network for Cross-Domain Few-Shot Segmentation [J].
Chen, Hao ;
Dong, Yonghan ;
Lu, Zheming ;
Yu, Yunlong ;
Han, Jungong .
2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, :967-976
[4]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[5]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]   Dual Branch Multi-Level Semantic Learning for Few-Shot Segmentation [J].
Chen, Yadang ;
Jiang, Ren ;
Zheng, Yuhui ;
Sheng, Bin ;
Yang, Zhi-Xin ;
Wu, Enhua .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :1432-1447
[8]   Temporal Cycle-Consistency Learning [J].
Dwibedi, Debidatta ;
Aytar, Yusuf ;
Tompson, Jonathan ;
Sermanet, Pierre ;
Zisserman, Andrew .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1801-1810
[9]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[10]   Self-support Few-Shot Semantic Segmentation [J].
Fan, Qi ;
Pei, Wenjie ;
Tai, Yu-Wing ;
Tang, Chi-Keung .
COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 :701-719