Event Recognition based on 3D Convolutional Networks

被引:0
|
作者
Chen, Rong [1 ]
Yu, Yuanlong [1 ]
Huang, ZhiYong [1 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Fujian, Peoples R China
来源
2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER) | 2018年
基金
中国国家自然科学基金;
关键词
Deep learning; event recognition; convolution; 3D; spatiotemporal information;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Videos have become widespread due to the ease of obtaining and going share via social platform. Event recognition in video has gained more and more attention in computer vision. This is a hard task that requires extracting meaningful spatiotemporal features for event recognition, mainly due to complexity and diversity of video events. Many proposed networks learn spatial features and temporal separately. In this paper, we propose a simple, yet effective approach for spatio-temporal features' learning: using deep spatial-temporal neural networks based on convolution 3D. The architecture is shown in Fig.1. The network can capture the motion information in multiple adjacent frames and appearance information simultaneously. Most of the famous 2D CNN networks follow a regular pattern: the former of convolution kernel size is bigger and the number of channel in latter layers increase, such as alexnet. So we choose the way that contacting two continuous convolutional layers to instead of a convolutional layer which its kernel size is bigger through synthetical consideration. We carry out experiments on KIM dataset, and evaluate them using 5-fold method. And this paper introduce two simple method of increasing the amount of training data and improving the performance on both. Experimental result shows that our model achieve an accuracy of 95.33% on KTH dataset, we further demonstrate that our model is a general and effective architecture through compared to other algorithms, including hand-crafted algorithms and other CNNs.
引用
收藏
页码:45 / 50
页数:6
相关论文
共 50 条
  • [31] Dynamic Hand Gesture Recognition Based on 3D Convolutional Neural Network Models
    Zhang, Wenjin
    Wang, Jiacun
    PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 224 - 229
  • [32] 3D convolutional neural network for object recognition: a review
    Rahul Dev Singh
    Ajay Mittal
    Rajesh K. Bhatia
    Multimedia Tools and Applications, 2019, 78 : 15951 - 15995
  • [33] 3D convolutional neural network for object recognition: a review
    Singh, Rahul Dev
    Mittal, Ajay
    Bhatia, Rajesh K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 15951 - 15995
  • [34] Abnormal behavior recognition based on edge feature and 3D convolutional neural network
    Bian, Chunlei
    Xu, Yiming
    Wang, Li
    Gu, Haifeng
    Zhou, Fangjie
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 1 - 6
  • [35] A Performance Evaluation of Classic Convolutional Neural Networks for 2D and 3D Palmprint and Palm Vein Recognition
    Jia, Wei
    Gao, Jian
    Xia, Wei
    Zhao, Yang
    Min, Hai
    Lu, Jing-Ting
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (01) : 18 - 44
  • [36] 3D Dynamic Hand Gestures Recognition Using the Leap Motion Sensor and Convolutional Neural Networks
    Lupinetti, Katia
    Ranieri, Andrea
    Giannini, Franca
    Monti, Marina
    AUGMENTED REALITY, VIRTUAL REALITY, AND COMPUTER GRAPHICS, AVR 2020, PT I, 2020, 12242 : 420 - 439
  • [37] A study of the effect of noise and occlusion on the accuracy of convolutional neural networks applied to 3D object recognition
    Garcia-Garcia, Alberto
    Garcia-Rodriguez, Jose
    Orts-Escolano, Sergio
    Oprea, Sergiu
    Gomez-Donoso, Francisco
    Cazorla, Miguel
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 164 : 124 - 134
  • [38] A Performance Evaluation of Classic Convolutional Neural Networks for 2D and 3D Palmprint and Palm Vein Recognition
    Wei Jia
    Jian Gao
    Wei Xia
    Yang Zhao
    Hai Min
    Jing-Ting Lu
    International Journal of Automation and Computing, 2021, 18 : 18 - 44
  • [39] 3D object retrieval based on multi-view convolutional neural networks
    Li, Xi-Xi
    Cao, Qun
    Wei, Sha
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (19) : 20111 - 20124
  • [40] 3D object retrieval based on multi-view convolutional neural networks
    Xi-Xi Li
    Qun Cao
    Sha Wei
    Multimedia Tools and Applications, 2017, 76 : 20111 - 20124