GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models

被引:0
作者
Liang, Chen [1 ,3 ]
Wang, Wenguan [2 ]
Miao, Jiaxu [1 ]
Yang, Yi [1 ]
机构
[1] Zhejiang Univ, CCAI, Hangzhou, Peoples R China
[2] Univ Technol Sydney, ReLER, AAII, Sydney, NSW, Australia
[3] Baidu Res, Beijing, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
基金
澳大利亚研究理事会; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prevalent semantic segmentation solutions are, in essence, a dense discriminative classifier of p(class |pixel feature). Though straightforward, this de facto paradigm neglects the underlying data distribution p(pixel feature |class), and struggles to identify out-of-distribution data. Going beyond this, we propose GMMSeg, a new family of segmentation models that rely on a dense generative classifier for the joint distribution p(pixel feature, class). For each class, GMMSeg builds Gaussian Mixture Models (GMMs) via Expectation-Maximization (EM), so as to capture class-conditional densities. Meanwhile, the deep dense representation is end-to-end trained in a discriminative manner, i.e., maximizing p(class |pixel feature). This endows GMMSeg with the strengths of both generative and discriminative models. With a variety of segmentation architectures and backbones, GMMSeg outperforms the discriminative counterparts on three closed-set datasets. More impressively, without any modification, GMMSeg even performs well on open-world datasets. We believe this work brings fundamental insights into the related fields.
引用
收藏
页数:16
相关论文
共 127 条
[1]  
[Anonymous], 2018, NeurIPS
[2]  
[Anonymous], 2019, INT C LEARN REPR, DOI DOI 10.1016/J.PROCI.2018.06.115
[3]  
[Anonymous], 2006, CVPR
[4]  
[Anonymous], 2019, NeurIPS
[5]  
[Anonymous], 2018, ECCV, DOI DOI 10.1007/978-3-030-01249-611
[6]  
[Anonymous], 2017, Enhancing the reliability of out-of-distribution image detection in neural networks
[7]  
[Anonymous], 2016, ICML
[8]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00770
[9]  
Ardizzone Lynton, 2020, NEURIPS
[10]  
Asano YM., 2020, INT C LEARN REPR ICL