Unsupervised Video Object Segmentation with Motion-Based Bilateral Networks

被引:112
作者
Li, Siyang [1 ,2 ]
Seybold, Bryan [2 ]
Vorobyov, Alexey [2 ]
Lei, Xuejing [1 ]
Kuo, C-C Jay [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Google AI Percept, Mountain View, CA 94043 USA
来源
COMPUTER VISION - ECCV 2018, PT III | 2018年 / 11207卷
关键词
Video object segmentation; Bilateral networks; Instance embeddings;
D O I
10.1007/978-3-030-01219-9_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the unsupervised video object segmentation problem where moving objects are segmented without prior knowledge of these objects. First, we propose a motion-based bilateral network to estimate the background based on the motion pattern of non-object regions. The bilateral network reduces false positive regions by accurately identifying background objects. Then, we integrate the background estimate from the bilateral network with instance embeddings into a graph, which allows multiple frame reasoning with graph edges linking pixels from different frames. We classify graph nodes by defining and minimizing a cost function, and segment the video frames based on the node labels. The proposed method outperforms previous state-of-the-art unsupervised video object segmentation methods against the DAVIS 2016 and the FBMS-59 datasets.
引用
收藏
页码:215 / 231
页数:17
相关论文
共 38 条
[1]   Fast High-Dimensional Filtering Using the Permutohedral Lattice [J].
Adams, Andrew ;
Baek, Jongmin ;
Davis, Myers Abraham .
COMPUTER GRAPHICS FORUM, 2010, 29 (02) :753-762
[2]  
[Anonymous], 2017, COMPUTER VISION PATT
[3]  
[Anonymous], 2011, ADV NEURAL INF PROCE
[4]  
[Anonymous], 2017, BRIT MACH VIS C
[5]  
[Anonymous], 2017, Semantic instance segmentation via deep metric learning
[6]  
[Anonymous], 2010, INT J COMPUT VISION, DOI DOI 10.1007/s11263-009-0275-4
[7]  
[Anonymous], 2017, ARXIV170306870
[8]   Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation [J].
Brox, Thomas ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :500-513
[9]   One-Shot Video Object Segmentation [J].
Caelles, S. ;
Maninis, K. -K. ;
Pont-Tuset, J. ;
Leal-Taixe, L. ;
Cremers, D. ;
Van Gool, L. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5320-5329
[10]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848