Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video

被引:107
作者
Li, Jia [2 ,3 ]
Tian, Yonghong [1 ]
Huang, Tiejun [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
关键词
Visual saliency; Probabilistic framework; Visual search tasks; Multi-task learning; ATTENTION; MODEL;
D O I
10.1007/s11263-010-0354-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a probabilistic multi-task learning approach for visual saliency estimation in video. In our approach, the problem of visual saliency estimation is modeled by simultaneously considering the stimulus-driven and task-related factors in a probabilistic framework. In this framework, a stimulus-driven component simulates the low-level processes in human vision system using multi-scale wavelet decomposition and unbiased feature competition; while a task-related component simulates the high-level processes to bias the competition of the input features. Different from existing approaches, we propose a multi-task learning algorithm to learn the task-related "stimulus-saliency" mapping functions for each scene. The algorithm also learns various fusion strategies, which are used to integrate the stimulus-driven and task-related components to obtain the visual saliency. Extensive experiments were carried out on two public eye-fixation datasets and one regional saliency dataset. Experimental results show that our approach outperforms eight state-of-the-art approaches remarkably.
引用
收藏
页码:150 / 165
页数:16
相关论文
共 39 条
[1]  
[Anonymous], 2007, Computer Vision and Pattern Recognition (CVPR), IEEE Conference on
[2]  
[Anonymous], COLL RES COMP NEUR A
[3]  
[Anonymous], 2006, Advances in Neural Information Processing Systems
[4]  
Argyriou Andreas, 2007, Advances in Neural Information Processing Systems, P41
[5]  
Cerf M., 2008, Advances in Neural Information Processing Systems, V20, P241
[6]   A novel cross-diamond search algorithm for fast block motion estimation [J].
Cheung, CH ;
Po, LM .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (12) :1168-1177
[7]  
Chun Marvin M., 2005, P246, DOI 10.1016/B978-012375731-9/50044-6
[8]  
Evgeniou T, 2005, J MACH LEARN RES, V6, P615
[9]  
Frith Chris, 2005, P105, DOI 10.1016/B978-012375731-9/50022-7
[10]  
Guo CL, 2008, PROC CVPR IEEE, P2908