Joint Stereo Video Deblurring, Scene Flow Estimation and Moving Object Segmentation

被引:27
作者
Pan, Liyuan [1 ]
Dai, Yuchao [2 ]
Liu, Miaomiao [1 ]
Porikli, Fatih [1 ]
Pan, Quan [3 ]
机构
[1] Australian Natl Univ, Res Sch Engn, Canberra, ACT 2601, Australia
[2] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Shaanxi, Peoples R China
[3] Northwestern Polytech Univ, Sch Automat, Xian 710129, Shaanxi, Peoples R China
基金
澳大利亚研究理事会;
关键词
Estimation; Cameras; Motion segmentation; Object segmentation; Image segmentation; Kernel; Semantics; Stereo deblurring; motion blur; scene flow; moving object segmentation; joint optimization; CAMERA SHAKE; MOTION; IMAGE;
D O I
10.1109/TIP.2019.2945867
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stereo videos for the dynamic scenes often show unpleasant blurred effects due to the camera motion and the multiple moving objects with large depth variations. Given consecutive blurred stereo video frames, we aim to recover the latent clean images, estimate the 3D scene flow and segment the multiple moving objects. These three tasks have been previously addressed separately, which fail to exploit the internal connections among these tasks and cannot achieve optimality. In this paper, we propose to jointly solve these three tasks in a unified framework by exploiting their intrinsic connections. To this end, we represent the dynamic scenes with the piece-wise planar model, which exploits the local structure of the scene and expresses various dynamic scenes. Under our model, these three tasks are naturally connected and expressed as the parameter estimation of 3D scene structure and camera motion (structure and motion for the dynamic scenes). By exploiting the blur model constraint, the moving objects and the 3D scene structure, we reach an energy minimization formulation for joint deblurring, scene flow and segmentation. We evaluate our approach extensively on both synthetic datasets and publicly available real datasets with fast-moving objects, camera motion, uncontrolled lighting conditions and shadows. Experimental results demonstrate that our method can achieve significant improvement in stereo video deblurring, scene flow estimation and moving object segmentation, over state-of-the-art methods.
引用
收藏
页码:1748 / 1761
页数:14
相关论文
共 77 条
[1]  
Ben-Ezra M, 2004, IEEE T PATTERN ANAL, V26, P689, DOI 10.1109/TPAMI.2004.1
[2]   A First-Order Primal-Dual Algorithm for Convex Problems with Applications to Imaging [J].
Chambolle, Antonin ;
Pock, Thomas .
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2011, 40 (01) :120-145
[3]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[4]  
Dai SY, 2008, PROC CVPR IEEE, P1865
[5]   FlowNet: Learning Optical Flow with Convolutional Networks [J].
Dosovitskiy, Alexey ;
Fischer, Philipp ;
Ilg, Eddy ;
Haeusser, Philip ;
Hazirbas, Caner ;
Golkov, Vladimir ;
van der Smagt, Patrick ;
Cremers, Daniel ;
Brox, Thomas .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2758-2766
[6]  
Faktor A., 2014, P BRIT MACH VIS C, V2, P8
[7]   JumpCut: Non-Successive Mask Transfer and Interpolation for Video Cutout [J].
Fan, Qingnan ;
Zhong, Fan ;
Lischinski, Dani ;
Cohen-Or, Daniel ;
Chen, Baoquan .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (06)
[8]   Removing camera shake from a single photograph [J].
Fergus, Rob ;
Singh, Barun ;
Hertzmann, Aaron ;
Roweis, Sam T. ;
Freeman, William T. .
ACM TRANSACTIONS ON GRAPHICS, 2006, 25 (03) :787-794
[9]   Real-time stereo vision for urban traffic scene understanding [J].
Franke, U ;
Joos, A .
PROCEEDINGS OF THE IEEE INTELLIGENT VEHICLES SYMPOSIUM 2000, 2000, :273-278
[10]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237