Unsupervised Video Object Segmentation with Motion-Based Bilateral Networks

被引：112

作者：

Li, Siyang ^{[1
,2
]}

Seybold, Bryan ^{[2
]}

Vorobyov, Alexey ^{[2
]}

Lei, Xuejing ^{[1
]}

Kuo, C-C Jay ^{[1
]}

机构：

[1] Univ Southern Calif, Los Angeles, CA 90007 USA

[2] Google AI Percept, Mountain View, CA 94043 USA

来源：

COMPUTER VISION - ECCV 2018, PT III | 2018年 / 11207卷

关键词：

Video object segmentation; Bilateral networks; Instance embeddings;

D O I：

10.1007/978-3-030-01219-9_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we study the unsupervised video object segmentation problem where moving objects are segmented without prior knowledge of these objects. First, we propose a motion-based bilateral network to estimate the background based on the motion pattern of non-object regions. The bilateral network reduces false positive regions by accurately identifying background objects. Then, we integrate the background estimate from the bilateral network with instance embeddings into a graph, which allows multiple frame reasoning with graph edges linking pixels from different frames. We classify graph nodes by defining and minimizing a cost function, and segment the video frames based on the node labels. The proposed method outperforms previous state-of-the-art unsupervised video object segmentation methods against the DAVIS 2016 and the FBMS-59 datasets.

引用

页码：215 / 231

页数：17

共 38 条

[1] Fast High-Dimensional Filtering Using the Permutohedral Lattice [J].

Adams, Andrew ;

Baek, Jongmin ;

Davis, Myers Abraham .

COMPUTER GRAPHICS FORUM, 2010, 29 (02) :753-762

[2]

[Anonymous], 2017, COMPUTER VISION PATT

[3]

[Anonymous], 2011, ADV NEURAL INF PROCE

[4]

[Anonymous], 2017, BRIT MACH VIS C

[5]

[Anonymous], 2017, Semantic instance segmentation via deep metric learning

[6]

[Anonymous], 2010, INT J COMPUT VISION, DOI DOI 10.1007/s11263-009-0275-4

[7]

[Anonymous], 2017, ARXIV170306870

[8] Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation [J].

Brox, Thomas ;

Malik, Jitendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :500-513

[9] One-Shot Video Object Segmentation [J].

Caelles, S. ;

Maninis, K. -K. ;

Pont-Tuset, J. ;

Leal-Taixe, L. ;

Cremers, D. ;

Van Gool, L. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5320-5329

[10] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

← 1 2 3 4 →