Semantic Co-segmentation in Videos

被引:47
作者
Tsai, Yi-Hsuan [1 ]
Zhong, Guangyu [1 ,2 ]
Yang, Ming-Hsuan [1 ]
机构
[1] UC Merced, Merced, CA 95340 USA
[2] Dalian Univ Technol, Dalian, Peoples R China
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
基金
美国国家科学基金会;
关键词
D O I
10.1007/978-3-319-46493-0_46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering and segmenting objects in videos is a challenging task due to large variations of objects in appearances, deformed shapes and cluttered backgrounds. In this paper, we propose to segment objects and understand their visual semantics from a collection of videos that link to each other, which we refer to as semantic co-segmentation. Without any prior knowledge on videos, we first extract semantic objects and utilize a tracking-based approach to generate multiple object-like tracklets across the video. Each tracklet maintains temporally connected segments and is associated with a predicted category. To exploit rich information from other videos, we collect tracklets that are assigned to the same category from all videos, and co-select tracklets that belong to true objects by solving a submodular function. This function accounts for object properties such as appearances, shapes and motions, and hence facilitates the co-segmentation process. Experiments on three video object segmentation datasets show that the proposed algorithm performs favorably against the other state-of-the-art methods.
引用
收藏
页码:760 / 775
页数:16
相关论文
共 36 条
  • [1] [Anonymous], 2015, CVPR
  • [2] [Anonymous], 2014, IEEE C COMP VIS PATT
  • [3] [Anonymous], 2013, IEEE C COMP VIS PATT
  • [4] [Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298724
  • [5] An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision
    Boykov, Y
    Kolmogorov, V
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) : 1124 - 1137
  • [6] C Rubio J., 2012, CVPR
  • [7] Predicting Multiple Attributes via Relative Multi-task Learning
    Chen, Lin
    Zhang, Qiang
    Li, Baoxin
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1027 - 1034
  • [8] The PASCAL Visual Object Classes Challenge: A Retrospective
    Everingham, Mark
    Eslami, S. M. Ali
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
  • [9] Godec M, 2011, IEEE I CONF COMP VIS, P81, DOI 10.1109/ICCV.2011.6126228
  • [10] Consistent Foreground Co-segmentation
    Guo, Jiaming
    Cheong, Loong-Fah
    Tan, Robby T.
    Zhou, Steven Zhiying
    [J]. COMPUTER VISION - ACCV 2014, PT IV, 2015, 9006 : 241 - 257