Machine Learning for Video Action Recognition: a Computer Vision Approach

被引:0
作者
Labayen, Mikel [1 ]
Aginako, Naiara [1 ]
Sierra, Basilio [1 ]
Olaizola, Igor G. [2 ]
Florez, Julian [2 ]
机构
[1] Univ Basque Country, Comp Sci & Artificial Intelligence Dept, Donostia San Sebastian, Spain
[2] Vicomtech, Data Intelligence Energy & Ind Proc, Donostia San Sebastian, Spain
来源
2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS) | 2018年
关键词
Action Recognition; Computer Vision; Image and Video Processing; Machine Learning; TRACE TRANSFORM; SELECTION; CLASSIFIER;
D O I
10.1109/SITIS.2018.00110
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The automatic detection of video action is still a challenging research task. In this paper, we consider a first atomic approach and its empirical evaluation to classify a single action in a short video sequence based on DITEC image characterization method. The presented method combines four different concepts: global image descriptors, image transformation algorithms, Machine Learning paradigms for supervised classification and Feature Subset Selection (FSS) techniques. Using DITEC descriptors, which are based on the Trace Transform, the information contained in a video is handled as an image. This allows us to apply Image Processing solutions for the analysis of the video, more concretely, of the occurring action. Key features are extracted to nourish Machine Learning classifiers in order to predict the action. The final step is to use a Feature Subset Selection (FSS) standard method to select the most accurate attributes for the identification of the action. The idea of understanding videos as images widens the possibilities for the analysis of temporal behaviour of actions within a video.
引用
收藏
页码:683 / 690
页数:8
相关论文
共 35 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]   Fast decorrelated neural network ensembles with random weights [J].
Alhamdoosh, Monther ;
Wang, Dianhui .
INFORMATION SCIENCES, 2014, 264 :104-117
[3]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[4]   Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech [J].
Alvarez, Aitor ;
Sierra, Basilio ;
Arruti, Andoni ;
Lopez-Gil, Juan-Miguel ;
Garay-Vitoria, Nestor .
SENSORS, 2016, 16 (01)
[5]  
[Anonymous], 2000, ISCA TUT RES WORKSH
[6]  
[Anonymous], 2014, C4. 5: programs for machine learning
[7]  
[Anonymous], P BMCV
[8]   Feature Selection for Speech Emotion Recognition in Spanish and Basque: On the Use of Machine Learning to Improve Human-Computer Interaction [J].
Arruti, Andoni ;
Cearreta, Idoia ;
Alvarez, Aitor ;
Lazkano, Elena ;
Sierra, Basilio .
PLOS ONE, 2014, 9 (10)
[9]  
BUNTINE W, 1991, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, P52
[10]  
Cestnik B., 1990, ECAI 90. Proceedings of the 9th European Conference on Artificial Intelligence, P147