MGNet: Mutual-guidance network for few-shot semantic segmentation

被引:12
作者
Chang, Zhaobin [1 ]
Lu, Yonggang [1 ]
Wang, Xiangwen [1 ]
Ran, Xingcheng [1 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Peoples R China
关键词
Few-shot semantic segmentation; Mutual-guidance network; Prototype learning; Non-parametric metric learning; Reverse auxiliary learning;
D O I
10.1016/j.engappai.2022.105431
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot semantic segmentation has recently drawn attention for its remarkable potential to segment the regions of different object classes with only a few labeled samples as guidance. Although recent methods have achieved impressive performance, there exist two critical bottleneck problems to be solved. First, most existing methods typically model a target class only using information from the foreground regions of support images, which actually does not adequately exploit the background region information of support images. Second, segmentation performance will be greatly affected when there is a large intra-class variation between the support and query images of the same class. To address these problems, we propose a mutual-guidance network (MGNet) for few-shot semantic segmentation to enhance the discriminative ability of class-specific prototypes. More specifically, the prototype learning module is first devised to learn the class-specific prototype of the foreground and background regions. Then, with non-parametric metric learning, the deep features of the query image are matched with multiple learned prototypes. Finally, to make good use of the ground truth mask of the support image, a reverse auxiliary learning module is constructed to reinforce the learned prototype. Extensive experiments on two standard benchmarks PASCAL-5(??) and COCO-20(??) are shown that the proposed method can yield competitive segmentation results with state-of-the-art methods. Surprisingly, our model achieves state-of-the-art results under both 1-shot and 5-shot tasks on more challenging COCO-20(??) when ResNet-101 is used as the backbone network.
引用
收藏
页数:13
相关论文
共 66 条
[1]   MaskSplit: Self-supervised Meta-learning for Few-shot Semantic Segmentation [J].
Amac, Mustafa Sercan ;
Sencan, Ahmet ;
Baran, Orhun Bugra ;
Ikizler-Cinbis, Nazli ;
Cinbis, Ramazan Gokberk .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :428-438
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need? [J].
Boudiaf, Malik ;
Kervadec, Hoel ;
Masud, Ziko Imtiaz ;
Piantanida, Pablo ;
Ben Ayed, Ismail ;
Dolz, Jose .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13974-13983
[4]  
Chen H., 2021, P IEEECVF C COMPUTER, p14 514
[5]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[6]  
Dong Nanqing, 2018, BMVC
[7]  
Dosovitskiy A., 2021, P 9 INT C LEARN REPR
[8]   DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes [J].
Elhassan, Mohammed A. M. ;
Huang, Chenxi ;
Yang, Chenhui ;
Munea, Tewodros Legesse .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
[9]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[10]   Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector [J].
Fan, Qi ;
Zhuo, Wei ;
Tang, Chi-Keung ;
Tai, Yu-Wing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4012-4021