Semantic Co-segmentation in Videos

被引：47

作者：

Tsai, Yi-Hsuan ^{[1
]}

Zhong, Guangyu ^{[1
,2
]}

Yang, Ming-Hsuan ^{[1
]}

机构：

[1] UC Merced, Merced, CA 95340 USA

[2] Dalian Univ Technol, Dalian, Peoples R China

来源：

COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷

基金：

美国国家科学基金会;

关键词：

D O I：

10.1007/978-3-319-46493-0_46

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Discovering and segmenting objects in videos is a challenging task due to large variations of objects in appearances, deformed shapes and cluttered backgrounds. In this paper, we propose to segment objects and understand their visual semantics from a collection of videos that link to each other, which we refer to as semantic co-segmentation. Without any prior knowledge on videos, we first extract semantic objects and utilize a tracking-based approach to generate multiple object-like tracklets across the video. Each tracklet maintains temporally connected segments and is associated with a predicted category. To exploit rich information from other videos, we collect tracklets that are assigned to the same category from all videos, and co-select tracklets that belong to true objects by solving a submodular function. This function accounts for object properties such as appearances, shapes and motions, and hence facilitates the co-segmentation process. Experiments on three video object segmentation datasets show that the proposed algorithm performs favorably against the other state-of-the-art methods.

引用

页码：760 / 775

页数：16

共 36 条

[1] [Anonymous], 2015, CVPR
[2] [Anonymous], 2014, IEEE C COMP VIS PATT
[3] [Anonymous], 2013, IEEE C COMP VIS PATT
[4] [Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298724
[5] An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision
Boykov, Y
Kolmogorov, V
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) : 1124 - 1137
[6] C Rubio J., 2012, CVPR
[7] Predicting Multiple Attributes via Relative Multi-task Learning
Chen, Lin
Zhang, Qiang
Li, Baoxin
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1027 - 1034
[8] The PASCAL Visual Object Classes Challenge: A Retrospective
Everingham, Mark
Eslami, S. M. Ali
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
[9] Godec M, 2011, IEEE I CONF COMP VIS, P81, DOI 10.1109/ICCV.2011.6126228
[10] Consistent Foreground Co-segmentation
Guo, Jiaming
Cheong, Loong-Fah
Tan, Robby T.
Zhou, Steven Zhiying
[J]. COMPUTER VISION - ACCV 2014, PT IV, 2015, 9006 : 241 - 257

← 1 2 3 4 →