MGNet: Mutual-guidance network for few-shot semantic segmentation

被引:12
作者
Chang, Zhaobin [1 ]
Lu, Yonggang [1 ]
Wang, Xiangwen [1 ]
Ran, Xingcheng [1 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Peoples R China
关键词
Few-shot semantic segmentation; Mutual-guidance network; Prototype learning; Non-parametric metric learning; Reverse auxiliary learning;
D O I
10.1016/j.engappai.2022.105431
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot semantic segmentation has recently drawn attention for its remarkable potential to segment the regions of different object classes with only a few labeled samples as guidance. Although recent methods have achieved impressive performance, there exist two critical bottleneck problems to be solved. First, most existing methods typically model a target class only using information from the foreground regions of support images, which actually does not adequately exploit the background region information of support images. Second, segmentation performance will be greatly affected when there is a large intra-class variation between the support and query images of the same class. To address these problems, we propose a mutual-guidance network (MGNet) for few-shot semantic segmentation to enhance the discriminative ability of class-specific prototypes. More specifically, the prototype learning module is first devised to learn the class-specific prototype of the foreground and background regions. Then, with non-parametric metric learning, the deep features of the query image are matched with multiple learned prototypes. Finally, to make good use of the ground truth mask of the support image, a reverse auxiliary learning module is constructed to reinforce the learned prototype. Extensive experiments on two standard benchmarks PASCAL-5(??) and COCO-20(??) are shown that the proposed method can yield competitive segmentation results with state-of-the-art methods. Surprisingly, our model achieves state-of-the-art results under both 1-shot and 5-shot tasks on more challenging COCO-20(??) when ResNet-101 is used as the backbone network.
引用
收藏
页数:13
相关论文
共 66 条
[61]   SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation [J].
Zhang, Xiaolin ;
Wei, Yunchao ;
Yang, Yi ;
Huang, Thomas S. .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) :3855-3865
[62]   Pyramid Scene Parsing Network [J].
Zhao, Hengshuang ;
Shi, Jianping ;
Qi, Xiaojuan ;
Wang, Xiaogang ;
Jia, Jiaya .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6230-6239
[63]   Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers [J].
Zheng, Sixiao ;
Lu, Jiachen ;
Zhao, Hengshuang ;
Zhu, Xiatian ;
Luo, Zekun ;
Wang, Yabiao ;
Fu, Yanwei ;
Feng, Jianfeng ;
Xiang, Tao ;
Torr, Philip H. S. ;
Zhang, Li .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6877-6886
[64]   E-Res U-Net: An improved U-Net model for segmentation of muscle images [J].
Zhou, Junsheng ;
Lu, Yiwen ;
Tao, Siyi ;
Cheng, Xuan ;
Huang, Chenxi .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
[65]   SAL:Selection and Attention Losses for Weakly Supervised Semantic Segmentation [J].
Zhou, Lei ;
Gong, Chen ;
Liu, Zhi ;
Fu, Keren .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :1035-1048
[66]   Improving Semantic Segmentation via Efficient Self-Training [J].
Zhu, Yi ;
Zhang, Zhongyue ;
Wu, Chongruo ;
Zhang, Zhi ;
He, Tong ;
Zhang, Hang ;
Manmatha, R. ;
Li, Mu ;
Smola, Alexander .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) :1589-1602