Machine Learning for Video Action Recognition: a Computer Vision Approach

被引:0
作者
Labayen, Mikel [1 ]
Aginako, Naiara [1 ]
Sierra, Basilio [1 ]
Olaizola, Igor G. [2 ]
Florez, Julian [2 ]
机构
[1] Univ Basque Country, Comp Sci & Artificial Intelligence Dept, Donostia San Sebastian, Spain
[2] Vicomtech, Data Intelligence Energy & Ind Proc, Donostia San Sebastian, Spain
来源
2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS) | 2018年
关键词
Action Recognition; Computer Vision; Image and Video Processing; Machine Learning; TRACE TRANSFORM; SELECTION; CLASSIFIER;
D O I
10.1109/SITIS.2018.00110
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The automatic detection of video action is still a challenging research task. In this paper, we consider a first atomic approach and its empirical evaluation to classify a single action in a short video sequence based on DITEC image characterization method. The presented method combines four different concepts: global image descriptors, image transformation algorithms, Machine Learning paradigms for supervised classification and Feature Subset Selection (FSS) techniques. Using DITEC descriptors, which are based on the Trace Transform, the information contained in a video is handled as an image. This allows us to apply Image Processing solutions for the analysis of the video, more concretely, of the occurring action. Key features are extracted to nourish Machine Learning classifiers in order to predict the action. The final step is to use a Feature Subset Selection (FSS) standard method to select the most accurate attributes for the identification of the action. The idea of understanding videos as images widens the possibilities for the analysis of temporal behaviour of actions within a video.
引用
收藏
页码:683 / 690
页数:8
相关论文
共 35 条
[11]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[12]   Exploring trace transform for robust human action recognition [J].
Goudelis, Georgios ;
Karpouzis, Konstantinos ;
Kollias, Stefanos .
PATTERN RECOGNITION, 2013, 46 (12) :3238-3248
[13]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422
[14]  
Igor G., 2013, VISAPP 2013 INT C CO, V1
[15]   Feature Subset Selection by Bayesian network-based optimization [J].
Inza, I ;
Larrañaga, P ;
Etxeberria, R ;
Sierra, B .
ARTIFICIAL INTELLIGENCE, 2000, 123 (1-2) :157-184
[16]   The trace transform and its applications [J].
Kadyrov, A ;
Petrou, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (08) :811-828
[17]   Large-scale Video Classification with Convolutional Neural Networks [J].
Karpathy, Andrej ;
Toderici, George ;
Shetty, Sanketh ;
Leung, Thomas ;
Sukthankar, Rahul ;
Fei-Fei, Li .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1725-1732
[18]   Dynamic selection of the best base classifier in One versus One [J].
Mendialdua, I. ;
Martinez-Otzeta, J. M. ;
Rodriguez-Rodriguez, I. ;
Ruiz-Vazquez, T. ;
Sierra, B. .
KNOWLEDGE-BASED SYSTEMS, 2015, 85 :298-306
[19]  
MITCHELL T, 1989, ANNU REV COMPUT SCI, V4, P417
[20]   Speech emotion recognition using hidden Markov models [J].
Nwe, TL ;
Foo, SW ;
De Silva, LC .
SPEECH COMMUNICATION, 2003, 41 (04) :603-623