A modified vector of locally aggregated descriptors approach for fast video classification

被引：21

作者：

Mironica, Ionut ^{[1
]}

Duta, Ionut Cosmin ^{[2
]}

Ionescu, Bogdan ^{[1
]}

Sebe, Nicu ^{[3
]}

机构：

[1] Univ Politehn Bucuresti, LAPI, Bucharest 061071, Romania

[2] Univ Trento, MHUG Grp, Comp Sci, Trento, Italy

[3] Univ Trento, Trento, Italy

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2016年 / 75卷 / 15期

关键词：

Capturing content variation in time in video; Modified vector of locally aggregated descriptor; Random forests; Video classification;

D O I：

10.1007/s11042-015-2819-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In order to reduce the computational complexity, most of the video classification approaches represent video data at frame level. In this paper we investigate a novel perspective that combines frame features to create a global descriptor. The main contributions are: (i) a fast algorithm to densely extract global frame features which are easier and faster to compute than spatio-temporal local features; (ii) replacing the traditional k-means visual vocabulary from Bag-of-Words with a Random Forest approach allowing a significant speedup; (iii) the use of a modified Vector of Locally Aggregated Descriptor(VLAD) combined with a Fisher kernel approach that replace the classic Bag-of-Words approach, allowing us to achieve high accuracy. By doing so, the proposed approach combines the frame-based features effectively capturing video content variation in time. We show that our framework is highly general and is not dependent on a particular type of descriptors. Experiments performed on four different scenarios: movie genre classification, human action recognition, daily activity recognition and violence scene classification, show the superiority of the proposed approach compared to the state of the art.

引用

页码：9045 / 9072

页数：28

共 66 条

[21]

De Herrera A.G.S., 2013, CLEF Working Notes

[22]

Demarty C-H, 2013, WORKING NOTES P

[23]

Demarty C-H, 2013, MEDIA TOOLS APPL

[24]

Demarty CH, 2014, INT WORK CONTENT MUL

[25]

Everingham M, 2012, PASCAL VISUAL OBJECT

[26]

Gold K., 2010, 2010 IEEE 9th International Conference on Development and Learning (ICDL 2010), P58, DOI 10.1109/DEVLRN.2010.5578864

[27]

Goto S, 2013, WORKING NOTES P

[28]

Ikizler-Cinbis N., 2011, P EUR C COMP VIS ECC, V6311, P494, DOI [http://dx.doi.org/10.1007/978-3-642-15549-9_36, DOI 10.1007/978-3-642-15549-9_36]

[29]

Imre C., 2011, INFORM THEORY CODING, DOI DOI 10.1017/CBO9780511921889

[30]

Ionescu B, 2012, MEDIEVAL WORKSH

← 1 2 3 4 5 6 7 →