Consistent Video Saliency Using Local Gradient Flow Optimization and Global Refinement

被引：344

作者：

Wang, Wenguan ^{[1
]}

Shen, Jianbing ^{[1
]}

Shao, Ling ^{[2
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China

[2] Northumbria Univ, Dept Comp Sci & Digital Technol, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2015年 / 24卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Video saliency; energy optimization; gradient flow field; spatiotemporal saliency energy; VISUAL-ATTENTION; DETECTION MODEL; ADAPTIVE IMAGE; OBJECT; SEGMENTATION; STATISTICS;

D O I：

10.1109/TIP.2015.2460013

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel spatiotemporal saliency detection method to estimate salient regions in videos based on the gradient flow field and energy optimization. The proposed gradient flow field incorporates two distinctive features: 1) intra-frame boundary information and 2) inter-frame motion information together for indicating the salient regions. Based on the effective utilization of both intra-frame and inter-frame information in the gradient flow field, our algorithm is robust enough to estimate the object and background in complex scenes with various motion patterns and appearances. Then, we introduce local as well as global contrast saliency measures using the foreground and background information estimated from the gradient flow field. These enhanced contrast saliency cues uniformly highlight an entire object. We further propose a new energy function to encourage the spatiotemporal consistency of the output saliency maps, which is seldom explored in previous video saliency methods. The experimental results show that the proposed algorithm outperforms state-of-the-art video saliency detection methods.

引用

页码：4185 / 4196

页数：12

共 51 条

[1]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[2]

[Anonymous], 2010, EPFLREPORT149300

[3]

[Anonymous], 2007, PROC IEEE C COMPUT V, DOI 10.1109/CVPR.2007.383267

[4]

[Anonymous], 2006, VOCUS: a visual attention system for object detection and goal-directed search

[5]

Borji A, 2012, PROC CVPR IEEE, P470, DOI 10.1109/CVPR.2012.6247710

[6] Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation [J].

Brox, Thomas ;

Malik, Jitendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :500-513

[7]

Brox T, 2010, LECT NOTES COMPUT SC, V6315, P282, DOI 10.1007/978-3-642-15555-0_21

[8]

Cheng G., 2008, Transportation Research Board 87th Annual Meeting Compendium of Papers, P1

[9] Efficient Salient Region Detection with Soft Image Abstraction [J].

Cheng, Ming-Ming ;

Warrell, Jonathan ;

Lin, Wen-Yan ;

Zheng, Shuai ;

Vineet, Vibhav ;

Crook, Nigel .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1529-1536

[10] Video adaptation for small display based on content recomposition [J].

Cheng, Wen-Huang ;

Wang, Chia-Wei ;

Wu, Ja-Ling .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (01) :43-58

← 1 2 3 4 5 6 →