Efficient large-scale action recognition in videos using extreme learning machines

被引:34
作者
Varol, Gul [1 ]
Salah, Albert Ali [1 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
关键词
Action recognition; Extreme learning machine; Fisher vector; Multimedia mining; RECOGNIZING HUMAN ACTIONS;
D O I
10.1016/j.eswa.2015.06.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel and efficient system for large-scale action recognition from realistic video clips. Our approach combines several recent advances in this area. We use improved dense trajectory features in combination with Fisher vector encoding, and perform learning and classification with extreme learning machine classifiers. The resulting system is a fast and accurate alternative to more traditional action classification approaches like bag of words and support vector machines. Additionally, we use mid-level features that encode information about presence of humans in the videos, as well as color distributions. We extensively evaluate each step of our pipeline in a comparative manner, and report results on the recently published THUMOS 2014 benchmark, which was introduced as a challenge dataset with temporally untrimmed videos and 101 action classes. We achieve 63.37% mean average precision using the challenge protocol (i.e. sequestered test labels and limited system submissions), and got the third rank among eleven participants. The results show that it is possible to obtain a high accuracy with extreme learning machines in an efficient way, without using the extensively trained and computationally heavy deep neural networks that the top performing systems of the challenge incorporated. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:8274 / 8282
页数:9
相关论文
共 56 条
[1]   A review on vision techniques applied to Human Behaviour Analysis for Ambient-Assisted Living [J].
Andre Chaaraoui, Alexandros ;
Climent-Perez, Pau ;
Florez-Revuelta, Francisco .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) :10873-10888
[2]  
[Anonymous], EUR C COMP VIS ECCV
[3]  
[Anonymous], ECCV WORKSH ACT REC
[4]  
[Anonymous], 2014, ADV NEURAL INFORM PR
[5]  
[Anonymous], TRENDS TOPICS COMPUT
[6]  
[Anonymous], 2014, CVPR
[7]  
[Anonymous], IEEE I CONF COMP VIS
[8]  
[Anonymous], ADV NEURAL INFORM PR
[9]  
[Anonymous], COMPUTER VISION PATT
[10]  
[Anonymous], 2013, ICCV WORKSHOP ACTION