Unsupervised Video Object Segmentation with Motion-Based Bilateral Networks

被引:102
作者
Li, Siyang [1 ,2 ]
Seybold, Bryan [2 ]
Vorobyov, Alexey [2 ]
Lei, Xuejing [1 ]
Kuo, C-C Jay [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Google AI Percept, Mountain View, CA 94043 USA
来源
COMPUTER VISION - ECCV 2018, PT III | 2018年 / 11207卷
关键词
Video object segmentation; Bilateral networks; Instance embeddings;
D O I
10.1007/978-3-030-01219-9_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the unsupervised video object segmentation problem where moving objects are segmented without prior knowledge of these objects. First, we propose a motion-based bilateral network to estimate the background based on the motion pattern of non-object regions. The bilateral network reduces false positive regions by accurately identifying background objects. Then, we integrate the background estimate from the bilateral network with instance embeddings into a graph, which allows multiple frame reasoning with graph edges linking pixels from different frames. We classify graph nodes by defining and minimizing a cost function, and segment the video frames based on the node labels. The proposed method outperforms previous state-of-the-art unsupervised video object segmentation methods against the DAVIS 2016 and the FBMS-59 datasets.
引用
收藏
页码:215 / 231
页数:17
相关论文
共 38 条
  • [1] Fast High-Dimensional Filtering Using the Permutohedral Lattice
    Adams, Andrew
    Baek, Jongmin
    Davis, Myers Abraham
    [J]. COMPUTER GRAPHICS FORUM, 2010, 29 (02) : 753 - 762
  • [2] [Anonymous], 2017, COMPUTER VISION PATT
  • [3] [Anonymous], 2011, ADV NEURAL INF PROCE
  • [4] [Anonymous], 2017, BRIT MACH VIS C
  • [5] [Anonymous], 2017, Semantic instance segmentation via deep metric learning
  • [6] [Anonymous], 2010, INT J COMPUT VISION, DOI DOI 10.1007/s11263-009-0275-4
  • [7] [Anonymous], 2017, ARXIV170306870
  • [8] Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation
    Brox, Thomas
    Malik, Jitendra
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) : 500 - 513
  • [9] One-Shot Video Object Segmentation
    Caelles, S.
    Maninis, K. -K.
    Pont-Tuset, J.
    Leal-Taixe, L.
    Cremers, D.
    Van Gool, L.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5320 - 5329
  • [10] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848