Feature Fusion for Human Action Recognition based on Classical Descriptors and 3D convolutional networks

被引:0
作者
Qin, Yang [1 ]
Mo, Lingfei [1 ]
Xie, Benyi [1 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing, Jiangsu, Peoples R China
来源
2017 ELEVENTH INTERNATIONAL CONFERENCE ON SENSING TECHNOLOGY (ICST) | 2017年
关键词
Human Action Recognition; Feature Fusion; 3D Convolutional Network; Classical Descriptor;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a feature fusion method that combines different kinds of classical descriptors and multi-channel 3-dimensional convolutional neural networks for the Human Action Recognition(HAR). The interrelationship between the classical descriptors and the 3D convolutional filters is explored. The spatio-temporal features are learned by the 3D convolutional networks which is trained on a large scale labeled video dataset. The classical descriptors are used as auxiliary feature to fuse a fusion feature vector with the learned features from 3D CNN. Feeding this new fusion feature vector into the SVM classifier can improve the recognition accuracy. The verification experiments are finished on different datasets. The recognition rate of the KTH dataset is 95.1% and that of the UCF101 dataset is 86.6%. The experimental results prove that this feature fusion method performs efficient and robust on the human action recognition.
引用
收藏
页码:487 / 491
页数:5
相关论文
共 35 条
  • [1] Human Activity Analysis: A Review
    Aggarwal, J. K.
    Ryoo, M. S.
    [J]. ACM COMPUTING SURVEYS, 2011, 43 (03)
  • [2] Human motion analysis: A review
    Aggarwal, JK
    Cai, Q
    [J]. IEEE NONRIGID AND ARTICULATED MOTION WORKSHOP, PROCEEDINGS, 1997, : 90 - 102
  • [3] Annane D, 2014, ADV NEURAL INFORM PR, V1, P568
  • [4] [Anonymous], IEEE T PATTERN ANAL
  • [5] [Anonymous], 2005, PROC CVPR IEEE
  • [6] [Anonymous], 2016, 2016 IEEE INT ULTR S
  • [7] [Anonymous], 2016, OVERVIEW GRADIENT DE
  • [8] Behnke S, 2003, LECT NOTES COMPUTER, V392, P1345
  • [9] Bengio Y., 2007, LARGE SCALE KERNEL M, V34, P1, DOI DOI 10.7551/MITPRESS/7496.003.0016
  • [10] Learning Deep Architectures for AI
    Bengio, Yoshua
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01): : 1 - 127