Efficient feature extraction, encoding and classification for action recognition

被引:152
作者
Kantorov, Vadim [1 ]
Laptev, Ivan [1 ]
机构
[1] Ecole Normale Super, INRIA, WILLOW, Dept Comp Sci, Paris, France
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
D O I
10.1109/CVPR.2014.332
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Local video features provide state-of-the-art performance for action recognition. While the accuracy of action recognition has been continuously improved over the recent years, the low speed of feature extraction and subsequent recognition prevents current methods from scaling up to real-size problems. We address this issue and first develop highly efficient video features using motion information in video compression. We next explore feature encoding by Fisher vectors and demonstrate accurate action recognition using fast linear classifiers. Our method improves the speed of video feature extraction, feature encoding and action classification by two orders of magnitude at the cost of minor reduction in recognition accuracy. We validate our approach and compare it to the state of the art on four recent action recognition datasets.
引用
收藏
页码:2593 / 2600
页数:8
相关论文
共 40 条
[1]  
[Anonymous], 2012, PAMI
[2]  
[Anonymous], 2009, CVPR
[3]  
[Anonymous], 2012, CVPR
[4]  
[Anonymous], 2011, BMVC
[5]  
[Anonymous], 2011, CVPR
[6]  
[Anonymous], 2009, WORKSH VID OR OBJ EV
[7]  
[Anonymous], 2013, IJCV
[8]  
[Anonymous], 2007, ICCV
[9]  
[Anonymous], 2010, ECCV
[10]  
[Anonymous], 2008, BMVC 2008 19 BRIT MA