Kernel regression in mixed feature spaces for spatio-temporal saliency detection

被引:24
作者
Li, Yansheng [1 ]
Tan, Yihua [1 ]
Yu, Jin-Gang [1 ]
Qi, Shengxiang [1 ]
Tian, Jinwen [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, Wuhan 430074, Peoples R China
关键词
Spatio-temporal saliency; Kernel regression; Mixed feature spaces; Hybrid fusion strategy; MOTION; IMAGE; VIDEO; MODEL; V1;
D O I
10.1016/j.cviu.2015.01.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatio-temporal saliency detection has attracted lots of research interests due to its competitive performance on wide multimedia applications. For spatio-temporal saliency detection, existing bottom-up algorithms often over-simplify the fusion strategy, which results in the inferior performance than the human vision system. In this paper, a novel bottom-up spatio-temporal saliency model is proposed to improve the accuracy of attentional region estimation in videos through fully exploiting the merit of fusion. In order to represent the space constructed by several types of features such as location, appearance and temporal cues extracted from video, kernel regression in mixed feature spaces (KR-MFS) including three approximation entity-models is proposed. Using KR-MFS, a hybrid fusion strategy which considers the combination of spatial and temporal saliency of each individual unit and incorporates the impacts from the neighboring units is presented and embedded into the spatio-temporal saliency model. The proposed model has been evaluated on the publicly available dataset. Experimental results show that the proposed spatio-temporal saliency model can achieve better performance than the state-of-the-art approaches. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:126 / 140
页数:15
相关论文
共 56 条
[21]  
Hou X., 2008, ADV NEURAL INF PROCE, V21, P681
[22]  
Itti L, 2005, PROC CVPR IEEE, P631
[23]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[24]   Computational modelling of visual attention [J].
Itti, L ;
Koch, C .
NATURE REVIEWS NEUROSCIENCE, 2001, 2 (03) :194-203
[25]   Emergence of complex cell properties by learning to generalize in natural scenes [J].
Karklin, Yan ;
Lewicki, Michael S. .
NATURE, 2009, 457 (7225) :83-U85
[26]  
Kienzle W, 2007, LECT NOTES COMPUT SC, V4713, P405
[27]   Visual saliency in noisy images [J].
Kim, Chelhwon ;
Milanfar, Peyman .
JOURNAL OF VISION, 2013, 13 (04)
[28]   Spatiotemporal Saliency Detection and Its Applications in Static and Dynamic Scenes [J].
Kim, Wonjun ;
Jung, Chanho ;
Kim, Changick .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (04) :446-456
[29]   Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video [J].
Li, Jia ;
Tian, Yonghong ;
Huang, Tiejun ;
Gao, Wen .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 90 (02) :150-165
[30]  
Li J, 2009, IEEE INT CON MULTI, P442, DOI 10.1109/ICME.2009.5202529