Spatiotemporal Features for Action Recognition and Salient Event Detection

被引:0
作者
Konstantinos Rapantzikos
Yannis Avrithis
Stefanos Kollias
机构
[1] National Technical University of Athens,
来源
Cognitive Computation | 2011年 / 3卷
关键词
Spatiotemporal visual saliency; Volumetric representation; Action recognition; Salient event detection;
D O I
暂无
中图分类号
学科分类号
摘要
Although the mechanisms of human visual understanding remain partially unclear, computational models inspired by existing knowledge on human vision have emerged and applied to several fields. In this paper, we propose a novel method to compute visual saliency from video sequences by counting in the actual spatiotemporal nature of the video. The visual input is represented by a volume in space--time and decomposed into a set of feature volumes in multiple resolutions. Feature competition is used to produce a saliency distribution of the input implemented by constrained minimization. The proposed constraints are inspired by and associated with the Gestalt laws. There are a number of contributions in this approach, namely extending existing visual feature models to a volumetric representation, allowing competition across features, scales and voxels, and formulating constraints in accordance with perceptual principles. The resulting saliency volume is used to detect prominent spatiotemporal regions and consequently applied to action recognition and perceptually salient event detection in video sequences. Comparisons against established methods on public datasets are given and reveal the potential of the proposed model. The experiments include three action recognition scenarios and salient temporal segment detection in a movie database annotated by humans.
引用
收藏
页码:167 / 184
页数:17
相关论文
共 31 条
[1]  
Treisman A(1980)A feature-integration theory of attention Cogn Psychol 12 97-136
[2]  
Koch C(1985)Shifts in selective visual attention: towards the underlying neural circuitry Computer Vision and Image Understanding 4 219-227
[3]  
Itti L(1998)A model of saliency-based visual attention for rapid scene analysis Pattern Analysis and Machine Intelligence, IEEE Transactions on 20 1254-1259
[4]  
Koch C(1995)Attentive mechanisms for dynamic and static scene analysis Optical Engineering 34 2428-2434
[5]  
Milanese R(2007)A bottom-up spatiotemporal visual attention model for video analysis IET Image Processing 1 237-248
[6]  
Gil S(1994)Visual search in continuous, naturalistic stimuli Vision research(Oxford) 34 1187-1195
[7]  
Rapantzikos K(1995)Modeling visual attention via selective tuning Artif Intell Eng Des Anal Manuf 78 507-545
[8]  
Tsapatsoulis N(1998)Feature detection with automatic scale selection International Journal of Computer Vision 30 79-116
[9]  
Wolfe J(2001)Saliency, scale and image description International Journal of Computer Vision 45 83-105
[10]  
Tsotsos JK(2005)A comparison of affine region detectors International Journal of Computer Vision 65 43-72