Kernel regression in mixed feature spaces for spatio-temporal saliency detection

被引:24
作者
Li, Yansheng [1 ]
Tan, Yihua [1 ]
Yu, Jin-Gang [1 ]
Qi, Shengxiang [1 ]
Tian, Jinwen [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, Wuhan 430074, Peoples R China
关键词
Spatio-temporal saliency; Kernel regression; Mixed feature spaces; Hybrid fusion strategy; MOTION; IMAGE; VIDEO; MODEL; V1;
D O I
10.1016/j.cviu.2015.01.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatio-temporal saliency detection has attracted lots of research interests due to its competitive performance on wide multimedia applications. For spatio-temporal saliency detection, existing bottom-up algorithms often over-simplify the fusion strategy, which results in the inferior performance than the human vision system. In this paper, a novel bottom-up spatio-temporal saliency model is proposed to improve the accuracy of attentional region estimation in videos through fully exploiting the merit of fusion. In order to represent the space constructed by several types of features such as location, appearance and temporal cues extracted from video, kernel regression in mixed feature spaces (KR-MFS) including three approximation entity-models is proposed. Using KR-MFS, a hybrid fusion strategy which considers the combination of spatial and temporal saliency of each individual unit and incorporates the impacts from the neighboring units is presented and embedded into the spatio-temporal saliency model. The proposed model has been evaluated on the publicly available dataset. Experimental results show that the proposed spatio-temporal saliency model can achieve better performance than the state-of-the-art approaches. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:126 / 140
页数:15
相关论文
共 56 条
[1]   SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].
Achanta, Radhakrishna ;
Shaji, Appu ;
Smith, Kevin ;
Lucchi, Aurelien ;
Fua, Pascal ;
Suesstrunk, Sabine .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281
[2]  
[Anonymous], 2010, Proceedings of the Asian Conference on Computer Vision
[3]  
[Anonymous], 2009, Beyond pixels: exploring new representations and applications for motion analysis
[4]  
[Anonymous], 2013, 27 AAAI C ART INT
[5]   State-of-the-Art in Visual Attention Modeling [J].
Borji, Ali ;
Itti, Laurent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :185-207
[6]   Structure and function of visual area MT [J].
Born, RT ;
Bradley, DC .
ANNUAL REVIEW OF NEUROSCIENCE, 2005, 28 :157-189
[7]   VARIABLE KERNEL ESTIMATES OF MULTIVARIATE DENSITIES [J].
BREIMAN, L ;
MEISEL, W ;
PURCELL, E .
TECHNOMETRICS, 1977, 19 (02) :135-144
[8]   Lucas/Kanade meets Horn/Schunck: Combining local and global optic flow methods [J].
Bruhn A. ;
Weickert J. ;
Schnörr C. .
International Journal of Computer Vision, 2005, 61 (3) :1-21
[9]  
Bugeau A., 2007, 6286 INRIA
[10]   Detection and segmentation of moving objects in complex scenes [J].
Bugeau, Aurelie ;
Perez, Patrick .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2009, 113 (04) :459-476