Global and Local Features Through Gaussian Mixture Models on Image Semantic Segmentation

被引:3
作者
Saire, Darwin [1 ]
Rivera, Adin Ramirez [1 ,2 ]
机构
[1] Univ Estadual Campinas, Inst Comp, BR-13083852 Campinas, Brazil
[2] Univ Oslo, Dept Informat, N-0373 Oslo, Norway
基金
巴西圣保罗研究基金会;
关键词
Feature extraction; Context modeling; Task analysis; Semantics; Data mining; Decoding; Image segmentation; Explainable latent spaces; context-aware features; Gaussian mixture models; semantic segmentation;
D O I
10.1109/ACCESS.2022.3192605
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The semantic segmentation task aims at dense classification at the pixel-wise level. Deep models exhibited progress in tackling this task. However, one remaining problem with these approaches is the loss of spatial precision, often produced at the segmented objects' boundaries. Our proposed model addresses this problem by providing an internal structure for the feature representations while extracting a global representation that supports the former. To fit the internal structure, during training, we predict a Gaussian Mixture Model from the data, which, merged with the skip connections and the decoding stage, helps avoid wrong inductive biases. Furthermore, our results show that we can improve semantic segmentation by providing both learning representations (global and local) with a clustering behavior and combining them. Finally, we present results demonstrating our advances in Cityscapes and Synthia datasets.
引用
收藏
页码:77323 / 77336
页数:14
相关论文
共 78 条
[1]   Higher Order Conditional Random Fields in Deep Neural Networks [J].
Arnab, Anurag ;
Jayasumana, Sadeep ;
Zheng, Shuai ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :524-540
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]  
Bishop C., 2006, Pattern Recognition and Machine Learning
[4]   Boundary Loss for Remote Sensing Imagery Semantic Segmentation [J].
Bokhovkin, Alexey ;
Burnaev, Evgeny .
ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT II, 2019, 11555 :388-401
[5]   XNet: A convolutional neural network (CNN) implementation for medical X-Ray image segmentation suitable for small datasets [J].
Bullock, Joseph ;
Cuesta-Lazaro, Carolina ;
Quera-Bofarull, Arnau .
MEDICAL IMAGING 2019: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2019, 10953
[6]  
Calinski T., 1974, Communications in Statistics, V3, P1, DOI [DOI 10.1080/03610927408827101, 10.1080/03610927408827101]
[7]   MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features [J].
Chen, Liang-Chieh ;
Hermans, Alexander ;
Papandreou, George ;
Schroff, Florian ;
Wang, Peng ;
Adam, Hartwig .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4013-4022
[8]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[9]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[10]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709