Prototype Mixture Models for Few-Shot Semantic Segmentation

被引:294
作者
Yang, Boyu [1 ]
Liu, Chang [1 ]
Li, Bohao [1 ]
Jiao, Jianbin [1 ]
Ye, Qixiang [1 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT VIII | 2020年 / 12353卷
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Few-shot segmentation; Few-shot learning; Mixture models;
D O I
10.1007/978-3-030-58598-3_45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot segmentation is challenging because objects within the support and query images could significantly differ in appearance and pose. Using a single prototype acquired directly from the support image to segment the query image causes semantic ambiguity. In this paper, we propose prototype mixture models (PMMs), which correlate diverse image regions with multiple prototypes to enforce the prototype-based semantic representation. Estimated by an Expectation-Maximization algorithm, PMMs incorporate rich channel-wised and spatial semantics from limited support images. Utilized as representations as well as classifiers, PMMs fully leverage the semantics to activate objects in the query image while depressing background regions in a duplex manner. Extensive experiments on Pascal VOC and MS-COCO datasets show that PMMs significantly improve upon state-of-the-arts. Particularly, PMMs improve 5-shot segmentation performance on MS-COCO by up to 5.82% with only a moderate cost for model size and inference speed (Code is available at github.com/Yang-Bob/PMMs.).
引用
收藏
页码:763 / 778
页数:16
相关论文
共 34 条
[1]  
Banerjee A, 2005, J MACH LEARN RES, V6, P1345
[2]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[5]  
Chen W.Y., 2019, IEEE ICLR
[6]  
Dong N., 2018, BMVC, V3
[7]  
Finn C, 2017, PR MACH LEARN RES, V70
[8]   Collect and Select: Semantic Alignment Metric Learning for Few-Shot Learning [J].
Hao, Fusheng ;
He, Fengxiang ;
Cheng, Jun ;
Wang, Lei ;
Cao, Jianzhong ;
Tao, Dacheng .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8459-8468
[9]   Low-shot Visual Recognition by Shrinking and Hallucinating Features [J].
Hariharan, Bharath ;
Girshick, Ross .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3037-3046
[10]  
Hariharan B, 2011, IEEE I CONF COMP VIS, P991, DOI 10.1109/ICCV.2011.6126343