CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning

被引:452
作者
Zhang, Chi [1 ]
Lin, Guosheng [1 ]
Liu, Fayao [2 ]
Yao, Rui [3 ]
Shen, Chunhua [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Univ Adelaide, Adelaide, SA, Australia
[3] China Univ Min & Technol, Beijing, Peoples R China
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/CVPR.2019.00536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress in semantic segmentation is driven by deep Convolutional Neural Networks and large-scale labeled image datasets. However, data labeling for pixel-wise segmentation is tedious and costly. Moreover, a trained model can only make predictions within a set of pre-defined classes. In this paper, we present CANet, a class-agnostic segmentation network that performs few-shot segmentation on new classes with only a few annotated images available. Our network consists of a two-branch dense comparison module which performs multi-level feature comparison between the support image and the query image, and an iterative optimization module which iteratively refines the predicted results. Furthermore, we introduce an attention mechanism to effectively fuse information from multiple support examples under the setting of k-shot learning. Experiments on PASCAL VOC 2012 show that our method achieves a mean Intersection-over-Union score of 55.4% for 1-shot segmentation and 57.1% for 5-shot segmentation, outperforming state-of-the-art methods by a large margin of 14.6% and 13.2%, respectively.
引用
收藏
页码:5212 / 5221
页数:10
相关论文
共 40 条
[11]   Simultaneous Detection and Segmentation [J].
Hariharan, Bharath ;
Arbelaez, Pablo ;
Girshick, Ross ;
Malik, Jitendra .
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 :297-312
[12]  
He K., 2016, CVPR, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]
[13]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[14]  
Ke MX, 2017, PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), P616, DOI 10.1109/CompComm.2017.8322618
[15]  
Koch G, 2015, ICML DEEP LEARN WORK, V2
[16]  
Krahenbuhl P., 2011, Adv. Neural Inf. Process. Syst., V24
[17]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[18]   ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation [J].
Lin, Di ;
Dai, Jifeng ;
Jia, Jiaya ;
He, Kaiming ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3159-3167
[19]   RefineNet: Multi-Path Refinement Networks for Dense Prediction [J].
Lin, Guosheng ;
Liu, Fayao ;
Milan, Anton ;
Shen, Chunhua ;
Reid, Ian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) :1228-1242
[20]   RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation [J].
Lin, Guosheng ;
Milan, Anton ;
Shen, Chunhua ;
Reid, Ian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5168-5177