Event Recognition based on 3D Convolutional Networks

被引:0
|
作者
Chen, Rong [1 ]
Yu, Yuanlong [1 ]
Huang, ZhiYong [1 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Fujian, Peoples R China
来源
2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER) | 2018年
基金
中国国家自然科学基金;
关键词
Deep learning; event recognition; convolution; 3D; spatiotemporal information;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Videos have become widespread due to the ease of obtaining and going share via social platform. Event recognition in video has gained more and more attention in computer vision. This is a hard task that requires extracting meaningful spatiotemporal features for event recognition, mainly due to complexity and diversity of video events. Many proposed networks learn spatial features and temporal separately. In this paper, we propose a simple, yet effective approach for spatio-temporal features' learning: using deep spatial-temporal neural networks based on convolution 3D. The architecture is shown in Fig.1. The network can capture the motion information in multiple adjacent frames and appearance information simultaneously. Most of the famous 2D CNN networks follow a regular pattern: the former of convolution kernel size is bigger and the number of channel in latter layers increase, such as alexnet. So we choose the way that contacting two continuous convolutional layers to instead of a convolutional layer which its kernel size is bigger through synthetical consideration. We carry out experiments on KIM dataset, and evaluate them using 5-fold method. And this paper introduce two simple method of increasing the amount of training data and improving the performance on both. Experimental result shows that our model achieve an accuracy of 95.33% on KTH dataset, we further demonstrate that our model is a general and effective architecture through compared to other algorithms, including hand-crafted algorithms and other CNNs.
引用
收藏
页码:45 / 50
页数:6
相关论文
共 50 条
  • [1] 3D Convolutional Neural Networks for Human Action Recognition
    Ji, Shuiwang
    Xu, Wei
    Yang, Ming
    Yu, Kai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) : 221 - 231
  • [2] A facial expression recognition method based on ensemble of 3D convolutional neural networks
    Sun, Wenyun
    Zhao, Haitao
    Jin, Zhong
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (07) : 2795 - 2812
  • [3] EEG-Based Emotion Recognition using 3D Convolutional Neural Networks
    Salama, Elham S.
    El-Khoribi, Reda A.
    Shoman, Mahmoud E.
    Shalaby, Mohamed A. Wahby
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (08) : 329 - 337
  • [4] A facial expression recognition method based on ensemble of 3D convolutional neural networks
    Wenyun Sun
    Haitao Zhao
    Zhong Jin
    Neural Computing and Applications, 2019, 31 : 2795 - 2812
  • [5] 3D Convolutional Neural Networks for Dynamic Sign Language Recognition
    Liang, Zhi-Jie
    Liao, Sheng-Bin
    Hu, Bing-Zhang
    COMPUTER JOURNAL, 2018, 61 (11) : 1724 - 1736
  • [6] 3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks
    Wang, Keze
    Wang, Xiaolong
    Lin, Liang
    Wang, Meng
    Zuo, Wangmeng
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 97 - 106
  • [7] 3D Convolutional Neural Networks for Soccer Object Motion Recognition
    Lee, Jiwon
    Kim, Yoonhyung
    Jeong, Minki
    Kim, Changick
    Nam, Do-Won
    Lee, JungSoo
    Moon, Sungwon
    Yoo, WonYoung
    2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2018, : 354 - 358
  • [8] SIGN LANGUAGE RECOGNITION USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Huang, Jie
    Zhou, Wengang
    Li, Houqiang
    Li, Weiping
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [9] An efficient attention module for 3d convolutional neural networks in action recognition
    Jiang, Guanghao
    Jiang, Xiaoyan
    Fang, Zhijun
    Chen, Shanshan
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7043 - 7057
  • [10] An efficient attention module for 3d convolutional neural networks in action recognition
    Guanghao Jiang
    Xiaoyan Jiang
    Zhijun Fang
    Shanshan Chen
    Applied Intelligence, 2021, 51 : 7043 - 7057