Deep Reasoning Network for Few-shot Semantic Segmentation

被引：9

作者：

Zhuge, Yunzhi ^{[1
]}

Shen, Chunhua ^{[1
]}

机构：

[1] Univ Adelaide, Adelaide, SA, Australia

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

关键词：

Few-shot semantic segmentation; dynamic convolutional network; feature integration;

D O I：

10.1145/3474085.3475658

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot Semantic Segmentation (FSS) is a challenging problem in computer vision, which aims at segmenting objects of interest with only one or several annotated samples per category for training. The essence of FSS is to disseminate information from support images to query images for segmenting the mutual object categories. In this paper, we propose a Dynamic Reasoning Network (DRNet) to adaptively generate the parameters of predicting layers and infer the segmentation mask for each unseen category. More specifically, an Attentional Feature Integration Sub-network (AFIS) is first proposed to extract consistent features from support images and query images. With shared weights, it stimulates the category consistency of different data streams. Then, a Pooling-based Guidance Module (PGM) is used to correlate support features with query features progressively. To disseminate information from support images to various query images, we further propose a Dynamic Prediction Module (DPM) for generating the parameters of predicting layers. The proposed modules are unified for the dynamic reasoning of each query image segmentation. Experiments on two public benchmarks have demonstrated that our approach achieves superior performance and outperforms the recent state-of-the-art methods.

引用

页码：5344 / 5352

页数：9

共 50 条

[1]

[Anonymous], 2018, P BRIT MACH VIS C

[2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[3] BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation [J].

Dai, Jifeng ;

He, Kaiming ;

Sun, Jian .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1635-1643

[4]

De Brabandere B, 2016, ADV NEUR IN, V29

[5] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[6]

Garcia Victor., 2017, ARXIV171104043

[7]

Hariharan B, 2011, IEEE I CONF COMP VIS, P991, DOI 10.1109/ICCV.2011.6126343

[8] Adaptive Pyramid Context Network for Semantic Segmentation [J].

He, Junjun ;

Deng, Zhongying ;

Zhou, Lei ;

Wang, Yali ;

Qiao, Yu .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7511-7520

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10]

Hung W.-C., 2018, P BRIT MACH VIS C

← 1 2 3 4 5 →