Consistent Video Saliency Using Local Gradient Flow Optimization and Global Refinement

被引:333
作者
Wang, Wenguan [1 ]
Shen, Jianbing [1 ]
Shao, Ling [2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
[2] Northumbria Univ, Dept Comp Sci & Digital Technol, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
基金
中国国家自然科学基金;
关键词
Video saliency; energy optimization; gradient flow field; spatiotemporal saliency energy; VISUAL-ATTENTION; DETECTION MODEL; ADAPTIVE IMAGE; OBJECT; SEGMENTATION; STATISTICS;
D O I
10.1109/TIP.2015.2460013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel spatiotemporal saliency detection method to estimate salient regions in videos based on the gradient flow field and energy optimization. The proposed gradient flow field incorporates two distinctive features: 1) intra-frame boundary information and 2) inter-frame motion information together for indicating the salient regions. Based on the effective utilization of both intra-frame and inter-frame information in the gradient flow field, our algorithm is robust enough to estimate the object and background in complex scenes with various motion patterns and appearances. Then, we introduce local as well as global contrast saliency measures using the foreground and background information estimated from the gradient flow field. These enhanced contrast saliency cues uniformly highlight an entire object. We further propose a new energy function to encourage the spatiotemporal consistency of the output saliency maps, which is seldom explored in previous video saliency methods. The experimental results show that the proposed algorithm outperforms state-of-the-art video saliency detection methods.
引用
收藏
页码:4185 / 4196
页数:12
相关论文
共 51 条
  • [1] Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
  • [2] [Anonymous], 2010, EPFLREPORT149300
  • [3] [Anonymous], 2007, PROC IEEE C COMPUT V, DOI 10.1109/CVPR.2007.383267
  • [4] [Anonymous], 2006, VOCUS: a visual attention system for object detection and goal-directed search
  • [5] Borji A, 2012, PROC CVPR IEEE, P470, DOI 10.1109/CVPR.2012.6247710
  • [6] Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation
    Brox, Thomas
    Malik, Jitendra
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) : 500 - 513
  • [7] Brox T, 2010, LECT NOTES COMPUT SC, V6315, P282, DOI 10.1007/978-3-642-15555-0_21
  • [8] Cheng G., 2008, Transportation Research Board 87th Annual Meeting Compendium of Papers, P1
  • [9] Efficient Salient Region Detection with Soft Image Abstraction
    Cheng, Ming-Ming
    Warrell, Jonathan
    Lin, Wen-Yan
    Zheng, Shuai
    Vineet, Vibhav
    Crook, Nigel
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1529 - 1536
  • [10] Video adaptation for small display based on content recomposition
    Cheng, Wen-Huang
    Wang, Chia-Wei
    Wu, Ja-Ling
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (01) : 43 - 58