Study on semantic image segmentation based on convolutional neural network

被引:11
作者
Li, Lin-Hui [1 ]
Qian, Bo [1 ]
Lian, Jing [1 ]
Zheng, Wei-Na [1 ]
Zhou, Ya-Fu [1 ]
机构
[1] Dalian Univ Technol, Sch Automot Engn, Fac Vehicle Engn & Mech, Dalian, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; disparity map; convolutional neural network;
D O I
10.3233/JIFS-162254
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, traditional machine learning algorithms have been gradually replaced by deep learning algorithms. In the field of computer vision, convolutional neural network is considered to be the most successful deep learning model. Based on convolutional neural network, the accuracy of image classification has been greatly improved. In this paper, a method for semantic image segmentation based on convolutional neural network is proposed. Firstly, the disparity map is introduced to improve the segmentation accuracy. To obtain the disparity map with more continuous disparity values, an image smoothing method is used to optimize the disparity map. Then, based on the AlexNet network, a fully convolutional network architecture is proposed for semantic image segmentation. The unpooling operation is employed to restore the extracted features to their original sizes. The experimental results demonstrate that the network can achieve high pixel-wise prediction accuracy and that using RGB-D image as the input of the network can reduce the noisy segmentation outputs.
引用
收藏
页码:3397 / 3404
页数:8
相关论文
共 24 条
[1]  
[Anonymous], 2014, COMPUTER VISION ECCV
[2]  
[Anonymous], JILIN NORMAL U J
[3]  
[Anonymous], 2015, ARXIV PREPRINT ARXIV
[4]  
[Anonymous], 2014, FULLY CONVOLUTIONAL
[5]  
[Anonymous], 2015, ABS150507293
[6]  
[Anonymous], PROC CVPR IEEE
[7]  
[Anonymous], INT C CONTR AUT SYST
[8]  
[Anonymous], 2015, Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs, DOI DOI 10.1080/17476938708814211
[9]   TEXTURE FEATURE PERFORMANCE FOR IMAGE SEGMENTATION [J].
DUBUF, JMH ;
KARDAN, M ;
SPANN, M .
PATTERN RECOGNITION, 1990, 23 (3-4) :291-309
[10]   Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture [J].
Eigen, David ;
Fergus, Rob .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2650-2658