Indoor scene segmentation algorithm based on full convolutional neural network

被引:11
作者
Zhu, Zijiang [1 ,2 ]
Li, Deming [3 ]
Hu, Yi [1 ]
Li, Junshan [1 ,2 ]
Liu, Dong [1 ]
Li, Jianjun [1 ]
机构
[1] Univ Foreign Studies, Sch Informat Sci & Technol, South China Business Coll Guangdong, Guangzhou 510545, Guangdong, Peoples R China
[2] Univ Foreign Studies, Inst Intelligent Informat Proc, South China Business Coll Guangdong, Guangzhou 510545, Guangdong, Peoples R China
[3] Guangxi Normal Univ, Coll Phys Sci & Technol, Guilin 541004, Guangxi, Peoples R China
关键词
Indoor scene; Convolutional neural network; Deep learning; Image segmentation; IMAGE SEGMENTATION; CLASSIFICATION;
D O I
10.1007/s00521-020-04961-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the leaps and bounds of computer performance and the advent of the era of big data, deep learning has drawn more and more attention from all walks of life. It can combine low-level features to form more abstract high-level features and describe the data more essentially. Therefore, it is widely used in various fields such as computer vision. Image segmentation is one of the most basic research topics in the field of computer vision. The main purpose is to extract regions of interest from images for later image processing research. In 3D reconstruction based on sequence images, the segmentation accuracy and speed of sequence images determine the quality and efficiency of target reconstruction. Therefore, when facing large-scale sequence images, the biggest problem is how to improve the segmentation speed while ensuring accuracy. Based on the above background, the research content of this article is an indoor scene segmentation algorithm based on full convolutional neural network. According to the characteristics of indoor application scenes, this paper proposes a fast convolutional neural network image segmentation method to segment the indoor scene image and construct the fast fully convolutional networks (FFCN) for indoor scene image segmentation uses inter-layer fusion to reduce the amount of network calculation parameters and avoid the loss of picture feature information by continuous convolution. In order to verify the effectiveness of the network, in this paper, a basic living object data set (XAUT data set) in an indoor environment is created. The XAUT data set is used to train the FFCN network under the Caffe framework to obtain an indoor scene segmentation model. In order to compare the effectiveness of the model, the structure of the worn FCN8s, FCN16s, and FCN32s models was fine-tuned, and the corresponding algorithm model for indoor scene segmentation was obtained by training with the XAUT data set. The experimental results show that the pixel recognition accuracy of all types of networks has reached 86%, and the mean IU ratio has reached more than 63%. The mean IU of the FCN8s network is the highest at 70.38%, but its segmentation speed is only 1/5 of FFCN. On the premise that other types of indicators are not much different, the average segmentation speed on FFCN fast segmentation convolutional neural network reaches 40 fps. It can be seen that the scale fusion technology can well avoid the loss of image feature information in the network convolution and reddening process. Compared with other FCN networks, it has a faster speed and is conducive to real-time image preprocessing.
引用
收藏
页码:8261 / 8273
页数:13
相关论文
共 25 条
[1]  
[Anonymous], 2015, ACTA ECOL SIN
[2]   Learning visual similarity for product design with convolutional neural networks [J].
Bell, Sean ;
Bala, Kavita .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04)
[3]   First Steps Toward Camera Model Identification With Convolutional Neural Networks [J].
Bondi, Luca ;
Baroffio, Luca ;
Gueera, David ;
Bestagini, Paolo ;
Delp, Edward J. ;
Tubaro, Stefano .
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) :259-263
[4]  
[曹家梓 Cao Jiazi], 2015, [仪器仪表学报, Chinese Journal of Scientific Instrument], V36, P776
[5]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[6]  
Deng C, 2015, COMMUN COMPUT INF SC, V482, P179
[7]   Sub-Markov Random Walk for Image Segmentation [J].
Dong, Xingping ;
Shen, Jianbing ;
Shao, Ling ;
Van Gool, Luc .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) :516-527
[8]   Vehicle Type Classification Using a Semisupervised Convolutional Neural Network [J].
Dong, Zhen ;
Wu, Yuwei ;
Pei, Mingtao ;
Jia, Yunde .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (04) :2247-2256
[9]   Residual Deconvolutional Networks for Brain Electron Microscopy Image Segmentation [J].
Fakhry, Ahmed ;
Zeng, Tao ;
Ji, Shuiwang .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (02) :447-456
[10]   HfO2-Based OxRAM Devices as Synapses for Convolutional Neural Networks [J].
Garbin, Daniele ;
Vianello, Elisa ;
Bichler, Olivier ;
Rafhay, Quentin ;
Gamrat, Christian ;
Ghibaudo, Gerard ;
DeSalvo, Barbara ;
Perniola, Luca .
IEEE TRANSACTIONS ON ELECTRON DEVICES, 2015, 62 (08) :2494-2501