An enhancement model based on dense atrous and inception convolution for image semantic segmentation

被引:10
作者
Zhou, Erjing [1 ]
Xu, Xiang [2 ]
Xu, Baomin [1 ]
Wu, Hongwei [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
[2] Simon Fraser Univ, Sch Comp Sci, Vancouver, BC, Canada
关键词
Image semantic segmentation; Dense convolution; Atrous convolution; Inception convolution; Full convolution neural network;
D O I
10.1007/s10489-022-03448-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of semantic segmentation is to classify each pixel in the image, so as to segment out the specific contour of the target. Most previous semantic segmentation models cannot generate enough semantic information for each pixel to understand the content of complex scenes. In this paper, we propose a novel semantic segmentation model Ince-DResAsppNet based on dense convoluted separation convolution. Unlike the previous model, our model revolves around reducing semantic information loss and enhancing detailed information. In the feature extraction part of the model, the idea of Dense and Ince is introduced to expand the number of channels on the basis of feature reuse. In the feature fusion part, Dense and Atrous's idea of dense dilated based on coprime factors is introduced, combined with multi-scale feature information to expand the receptive field and collect more dense pixels. Experiments conducted on the dataset PASCAL VOC 2012 and the CityScapes dataset show that our method performs better than the existing semantic segmentation model. Our model achieves 83.3% and 78.1% segmentation accuracy on the mIoU indicator, which surpasses many classical semantic segmentation models.
引用
收藏
页码:5519 / 5531
页数:13
相关论文
共 42 条
[1]  
[Anonymous], 2016, ICLR
[2]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.348
[3]  
[Anonymous], 2016, ARXIV160504068
[4]  
[Anonymous], 2016, ARXIV160702537
[5]  
[Anonymous], 2015, ARXIV151108119
[6]  
Badrinarayanan V., 2015, ARXIV
[7]   Fast, Exact and Multi-scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs [J].
Chandra, Siddhartha ;
Kokkinos, Iasonas .
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 :402-418
[8]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[9]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[10]  
Chen LW, 2016, IEEE CONF COMPUT