Learning a convolutional neural network for propagation-based stereo image segmentation

被引:48
作者
Li, Xujie [1 ]
Huang, Hui [1 ]
Zhao, Hanli [1 ]
Wang, Yandan [1 ]
Hu, Mingxiao [1 ]
机构
[1] Wenzhou Univ, Intelligent Informat Syst Inst, Wenzhou 325035, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo image segmentation; Convolutional neural network; Coherent disparities; Energy minimization framework;
D O I
10.1007/s00371-018-1582-y
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stereo image segmentation is the key technology in stereo image editing with the population of stereoscopic 3D media. Most previous methods perform stereo image segmentation on both views relying primarily on per-pixel disparities, which results in the segmentation quality closely connected to the accuracy of the disparities. Therefore, a mechanism to remove the errors of the disparities are highly demanded. To date, there's no such a method yet that can produce accurate disparity maps. In this paper, we propose a novel convolutional neural network (CNN)-based framework, which will automatically propagate the segmentation result from one view to the other. The key problem of accurate stereo image segmentation is the missing of occluded regions. To solve this problem, the CNN architecture is proposed to improve the stereo segmentation performance. In order to address the inevitable inaccuracies problem of the disparities computed from a stereo pair of images, we utilize the coherent disparity propagation that propagates segment result via those pixels with coherent disparities. The pixels by coherent disparity propagation and the high confidence pixels of the object probability map produced by the CNN architecture are then used to generate the initial reliable pixels to perform an energy minimization framework-based segmentation. A comprehensive evaluations and comparisons on Middlebury and Adobe benchmark datasets show the effectiveness of our proposed method in terms of high-quality results, and the robustness against various types of inputs.
引用
收藏
页码:39 / 52
页数:14
相关论文
共 39 条
[1]  
[Anonymous], 2017, IEEE T DEPENDABLE SE
[2]  
[Anonymous], 2017, ARXIV170700652
[3]  
Bertasius Gedas, 2016, ARXIV160507681
[4]   Laplacian Coordinates for Seeded Image Segmentation [J].
Casaca, Wallace ;
Nonato, Luis Gustavo ;
Taubin, Gabriel .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :384-391
[5]   KNN Matting [J].
Chen, Qifeng ;
Li, Dingzeyu ;
Tang, Chi-Keung .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (09) :2175-2188
[6]   Natural Image Matting Using Deep Convolutional Neural Networks [J].
Cho, Donghyeon ;
Tai, Yu-Wing ;
Kweon, Inso .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :626-643
[7]   Sub-Markov Random Walk for Image Segmentation [J].
Dong, Xingping ;
Shen, Jianbing ;
Shao, Ling ;
Van Gool, Luc .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) :516-527
[8]   DeepProp: Extracting Deep Features from a Single Image for Edit Propagation [J].
Endo, Yuki ;
Iizuka, Satoshi ;
Kanamori, Yoshihiro ;
Mitani, Jun .
COMPUTER GRAPHICS FORUM, 2016, 35 (02) :189-201
[9]   Guided Image Filtering [J].
He, Kaiming ;
Sun, Jian ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (06) :1397-1409
[10]  
Hirschmüller H, 2008, IEEE T PATTERN ANAL, V30, P328, DOI 10.1109/TPAMl.2007.1166