Deep Reasoning Network for Few-shot Semantic Segmentation

被引:9
作者
Zhuge, Yunzhi [1 ]
Shen, Chunhua [1 ]
机构
[1] Univ Adelaide, Adelaide, SA, Australia
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
关键词
Few-shot semantic segmentation; dynamic convolutional network; feature integration;
D O I
10.1145/3474085.3475658
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot Semantic Segmentation (FSS) is a challenging problem in computer vision, which aims at segmenting objects of interest with only one or several annotated samples per category for training. The essence of FSS is to disseminate information from support images to query images for segmenting the mutual object categories. In this paper, we propose a Dynamic Reasoning Network (DRNet) to adaptively generate the parameters of predicting layers and infer the segmentation mask for each unseen category. More specifically, an Attentional Feature Integration Sub-network (AFIS) is first proposed to extract consistent features from support images and query images. With shared weights, it stimulates the category consistency of different data streams. Then, a Pooling-based Guidance Module (PGM) is used to correlate support features with query features progressively. To disseminate information from support images to various query images, we further propose a Dynamic Prediction Module (DPM) for generating the parameters of predicting layers. The proposed modules are unified for the dynamic reasoning of each query image segmentation. Experiments on two public benchmarks have demonstrated that our approach achieves superior performance and outperforms the recent state-of-the-art methods.
引用
收藏
页码:5344 / 5352
页数:9
相关论文
共 50 条
[1]  
[Anonymous], 2018, P BRIT MACH VIS C
[2]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[3]   BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation [J].
Dai, Jifeng ;
He, Kaiming ;
Sun, Jian .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1635-1643
[4]  
De Brabandere B, 2016, ADV NEUR IN, V29
[5]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[6]  
Garcia Victor., 2017, ARXIV171104043
[7]  
Hariharan B, 2011, IEEE I CONF COMP VIS, P991, DOI 10.1109/ICCV.2011.6126343
[8]   Adaptive Pyramid Context Network for Semantic Segmentation [J].
He, Junjun ;
Deng, Zhongying ;
Zhou, Lei ;
Wang, Yali ;
Qiao, Yu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7511-7520
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
Hung W.-C., 2018, P BRIT MACH VIS C