A modified vector of locally aggregated descriptors approach for fast video classification

被引:21
作者
Mironica, Ionut [1 ]
Duta, Ionut Cosmin [2 ]
Ionescu, Bogdan [1 ]
Sebe, Nicu [3 ]
机构
[1] Univ Politehn Bucuresti, LAPI, Bucharest 061071, Romania
[2] Univ Trento, MHUG Grp, Comp Sci, Trento, Italy
[3] Univ Trento, Trento, Italy
关键词
Capturing content variation in time in video; Modified vector of locally aggregated descriptor; Random forests; Video classification;
D O I
10.1007/s11042-015-2819-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to reduce the computational complexity, most of the video classification approaches represent video data at frame level. In this paper we investigate a novel perspective that combines frame features to create a global descriptor. The main contributions are: (i) a fast algorithm to densely extract global frame features which are easier and faster to compute than spatio-temporal local features; (ii) replacing the traditional k-means visual vocabulary from Bag-of-Words with a Random Forest approach allowing a significant speedup; (iii) the use of a modified Vector of Locally Aggregated Descriptor(VLAD) combined with a Fisher kernel approach that replace the classic Bag-of-Words approach, allowing us to achieve high accuracy. By doing so, the proposed approach combines the frame-based features effectively capturing video content variation in time. We show that our framework is highly general and is not dependent on a particular type of descriptors. Experiments performed on four different scenarios: movie genre classification, human action recognition, daily activity recognition and violence scene classification, show the superiority of the proposed approach compared to the state of the art.
引用
收藏
页码:9045 / 9072
页数:28
相关论文
共 66 条
[1]  
Almeida J, 2014, LECT NOTES COMPUT SC, V8827, P604, DOI 10.1007/978-3-319-12568-8_74
[2]  
[Anonymous], P ACM INT C MULT
[3]  
[Anonymous], 2010, ISMIR
[4]  
[Anonymous], 2013, ICCV WORKSHOP ACTION
[5]  
[Anonymous], P TRECVID 2013
[6]  
[Anonymous], 2014, CoRR
[7]  
[Anonymous], IEEE INT C COMP VIS
[8]  
[Anonymous], P MEDIAEVAL 2012 WOR
[9]  
[Anonymous], 2013, WEKA Manual for Version 3-6-10
[10]  
[Anonymous], INT MULTIMED INF RET