Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video

被引：107

作者：

Li, Jia ^{[2
,3
]}

Tian, Yonghong ^{[1
]}

Huang, Tiejun ^{[1
]}

Gao, Wen ^{[1
]}

机构：

[1] Peking Univ, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China

[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China

[3] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2010年 / 90卷 / 02期

关键词：

Visual saliency; Probabilistic framework; Visual search tasks; Multi-task learning; ATTENTION; MODEL;

D O I：

10.1007/s11263-010-0354-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a probabilistic multi-task learning approach for visual saliency estimation in video. In our approach, the problem of visual saliency estimation is modeled by simultaneously considering the stimulus-driven and task-related factors in a probabilistic framework. In this framework, a stimulus-driven component simulates the low-level processes in human vision system using multi-scale wavelet decomposition and unbiased feature competition; while a task-related component simulates the high-level processes to bias the competition of the input features. Different from existing approaches, we propose a multi-task learning algorithm to learn the task-related "stimulus-saliency" mapping functions for each scene. The algorithm also learns various fusion strategies, which are used to integrate the stimulus-driven and task-related components to obtain the visual saliency. Extensive experiments were carried out on two public eye-fixation datasets and one regional saliency dataset. Experimental results show that our approach outperforms eight state-of-the-art approaches remarkably.

引用

页码：150 / 165

页数：16

共 39 条

[1]

[Anonymous], 2007, Computer Vision and Pattern Recognition (CVPR), IEEE Conference on

[2]

[Anonymous], COLL RES COMP NEUR A

[3]

[Anonymous], 2006, Advances in Neural Information Processing Systems

[4]

Argyriou Andreas, 2007, Advances in Neural Information Processing Systems, P41

[5]

Cerf M., 2008, Advances in Neural Information Processing Systems, V20, P241

[6] A novel cross-diamond search algorithm for fast block motion estimation [J].

Cheung, CH ;

Po, LM .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (12) :1168-1177

[7]

Chun Marvin M., 2005, P246, DOI 10.1016/B978-012375731-9/50044-6

[8]

Evgeniou T, 2005, J MACH LEARN RES, V6, P615

[9]

Frith Chris, 2005, P105, DOI 10.1016/B978-012375731-9/50022-7

[10]

Guo CL, 2008, PROC CVPR IEEE, P2908

← 1 2 3 4 →