Evolutionary optimization of video event classification

被引:2
作者
Tahayna, B. [1 ]
Belkhatir, M. [2 ]
Alhashmi, S. M. [1 ]
O'Daniel, T. [1 ]
机构
[1] Monash Univ, Sch IT, Bandar Sunway 46150, Malaysia
[2] Univ Lyon 1, Lyon Inst Technol, F-69365 Lyon, France
关键词
video semantic events; classification; genetic algorithms; support vector machines; optimization; motion; PERFORMANCE; ALGORITHM;
D O I
10.1080/00207160.2011.629654
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Detection and classification of semantic events in video documents have been major tasks in the domain of automatic video analysis. The multimodality of video data brings about challenging issues and the effectiveness of its automatic semantic processing has been hampered by the video semantic gap, that is, the gap between low-level visual and spatio-temporal features used as a digital representation of video documents and their semantic interpretation. Traditionally, these low-level features are concatenated in high-dimensional spaces and classified as high-level semantic events through computational learning methods. Support vector machines (SVMs), widely used in the literature, have often shown to outperform other popular learning techniques for this task. However, concatenation of several features with different intrinsic properties and wide dynamical ranges may result in curse of dimensionality and redundancy issues. Among the factors impacting the effectiveness of classification are (i) feature subset selection, (ii) tuning of classifiers' parameters and (iii) selection of proper training instances. In this paper, we address these factors and propose a technique (denoted as GAoptSVM) for an optimal SVM-based video event classification through the use of an evolutionary optimization technique. Extensive experiments on the 50 h video data set of TRECVid 2008 event detection task and large quantities of video data collected from Youtube and CMU Graphics Lab Motion Capture Database demonstrate that our approach outperforms the traditional SVM and effectively classify video events with noticeable accuracy.
引用
收藏
页码:3784 / 3802
页数:19
相关论文
共 47 条
[1]  
Ahn H, 2006, LECT NOTES COMPUT SC, V4234, P420
[2]  
Amine A, 2008, LECT NOTES COMPUT SC, V5099, P321, DOI 10.1007/978-3-540-69905-7_37
[3]  
[Anonymous], P ACM INT C IM VID R
[4]  
[Anonymous], IAPR INT C MACH VIS
[5]  
[Anonymous], P 6 ACM INT C IM VID
[6]  
[Anonymous], TREC VIDEO RETRIEVAL
[7]  
[Anonymous], CMU GRAPH LAB MO CAP
[8]  
[Anonymous], P INT C COMP VIS PAT
[9]  
[Anonymous], IEEE T CIRCUITS SYST
[10]   Personalized abstraction of broadcasted American football video by highlight selection [J].
Babaguchi, N ;
Kawai, Y ;
Ogura, T ;
Kitahashi, T .
IEEE TRANSACTIONS ON MULTIMEDIA, 2004, 6 (04) :575-586