Human action recognition by means of subtensor projections and dense trajectories

被引:36
作者
Maria Carmona, Josep [1 ]
Climent, Joan [1 ]
机构
[1] Barcelona Tech UPC, Automat Control & Comp Eng Dept, C Jordi Girona 1-3, Barcelona 08034, Spain
关键词
Action recognition; Subtensors; Dense trajectories; Keypoint descriptors; Temporal template; FISHER VECTOR; FEATURES; CLASSIFICATION; INFORMATION;
D O I
10.1016/j.patcog.2018.04.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In last years, most human action recognition works have used dense trajectories features, to achieve state-of-the-art results. Histograms of Oriented Gradients (HOG), Histogram of Optical Flow (HOF) and Motion Boundary Histograms (MBH) features are extracted from regions and being tracked across the frames. The goal of this paper is to improve the performance obtained by means of Improved Dense Trajectories (IDTs), adding new features based on temporal templates. We construct these templates considering a video sequence as a third-order tensor and computing three different projections. We use several functions for projecting the fibers from the video sequences, and combined them by means of sum pooling. As a first contribution of our work, we present in detail the method based on tensor projections. First, we have assessed the results obtained using only template based action recognition. Next, in order to achieve state-of-art recognition rates, we have fused our features with those of IDTs. This is the second contribution of the article. Experiments on four different public datasets have shown that this technique improves IDTs performance and that the results outperform the ones obtained by most of the state-of-the-art techniques for action recognition. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:443 / 455
页数:13
相关论文
共 58 条
[1]   Local Gabor Binary Patterns from Three Orthogonal Planes for Automatic Facial Expression Recognition [J].
Almaev, Timur R. ;
Valstar, Michel F. .
2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, :356-361
[2]  
[Anonymous], PAMI
[3]  
[Anonymous], 2014, ADV NEURAL INFORM PR
[4]  
[Anonymous], IEEE I CONF COMP VIS
[5]  
[Anonymous], COMPUT RES REPOS
[6]  
[Anonymous], P 13 INT WORKSH CONT
[7]  
[Anonymous], P INT C COMP VIS ICC
[8]  
[Anonymous], COMPUT RES REPOS
[9]   Dynamic Image Networks for Action Recognition [J].
Bilen, Hakan ;
Fernando, Basura ;
Gavves, Efstratios ;
Vedaldi, Andrea ;
Gould, Stephen .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3034-3042
[10]   The recognition of human movement using temporal templates [J].
Bobick, AF ;
Davis, JW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267