Event Recognition based on 3D Convolutional Networks

被引:0
|
作者
Chen, Rong [1 ]
Yu, Yuanlong [1 ]
Huang, ZhiYong [1 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Fujian, Peoples R China
来源
2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER) | 2018年
基金
中国国家自然科学基金;
关键词
Deep learning; event recognition; convolution; 3D; spatiotemporal information;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Videos have become widespread due to the ease of obtaining and going share via social platform. Event recognition in video has gained more and more attention in computer vision. This is a hard task that requires extracting meaningful spatiotemporal features for event recognition, mainly due to complexity and diversity of video events. Many proposed networks learn spatial features and temporal separately. In this paper, we propose a simple, yet effective approach for spatio-temporal features' learning: using deep spatial-temporal neural networks based on convolution 3D. The architecture is shown in Fig.1. The network can capture the motion information in multiple adjacent frames and appearance information simultaneously. Most of the famous 2D CNN networks follow a regular pattern: the former of convolution kernel size is bigger and the number of channel in latter layers increase, such as alexnet. So we choose the way that contacting two continuous convolutional layers to instead of a convolutional layer which its kernel size is bigger through synthetical consideration. We carry out experiments on KIM dataset, and evaluate them using 5-fold method. And this paper introduce two simple method of increasing the amount of training data and improving the performance on both. Experimental result shows that our model achieve an accuracy of 95.33% on KTH dataset, we further demonstrate that our model is a general and effective architecture through compared to other algorithms, including hand-crafted algorithms and other CNNs.
引用
收藏
页码:45 / 50
页数:6
相关论文
共 50 条
  • [21] RECOGNITION OF 3D SURFACE FRACTAL DIMENSION BASED ON CONVOLUTIONAL NEURAL NETWORK
    Wang, Liuqun
    Lei, Sheng
    Wang, Zijie
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2024,
  • [22] Trajectory-Based 3D Convolutional Descriptors for Human Action Recognition
    Arif, Sheeraz
    Wang, Jing
    Hussain, Fida
    Fei, Zesong
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (04) : 851 - 870
  • [23] 3D InspectionNet: A Deep 3D Convolutional Neural Networks Based Approach for 3D Defect Detection of Concrete Columns
    Dizaji, Mehrdad S.
    Harris, Devin K.
    NONDESTRUCTIVE CHARACTERIZATION AND MONITORING OF ADVANCED MATERIALS, AEROSPACE, CIVIL INFRASTRUCTURE, AND TRANSPORTATION XIII, 2019, 10971
  • [24] Automated Training of Convolutional Networks by Virtual 3D Models for Parts Recognition in Assembly Process
    Zidek, Kamil
    Lazorik, Peter
    Piter, Jan
    Pavlenko, Ivan
    Hosovsky, Alexander
    ADVANCES IN MANUFACTURING II, VOL 1 - SOLUTIONS FOR INDUSTRY 4.0, 2019, : 287 - 297
  • [25] Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition
    Mercanoglu Sincan, Ozge
    Keles, Hacer Yalim
    IEEE ACCESS, 2022, 10 : 18608 - 18618
  • [26] Action Recognition in Videos with Spatio-Temporal Fusion 3D Convolutional Neural Networks
    Wang, Y.
    Shen, X. J.
    Chen, H. P.
    Sun, J. X.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (03) : 580 - 587
  • [27] KNOWLEDGE BASED 3D BUILDING MODEL RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS FROM LIDAR AND AERIAL IMAGERIES
    Alidoost, F.
    Arefi, H.
    XXIII ISPRS CONGRESS, COMMISSION III, 2016, 41 (B3): : 833 - 840
  • [28] 3D Reconstruction From Monocular Images Based on Deep Convolutional Networks
    Ren, Yinhui
    Wang, Zhihui
    Fan, Daoerji
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 248 - 252
  • [29] TOWARDS 3D CONVOLUTIONAL NEURAL NETWORKS WITH MESHES
    Dominguez, Miguel
    Such, Felipe Petroski
    Sah, Shagan
    Ptucha, Raymond
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3929 - 3933
  • [30] Working activity recognition approach based on 3D deep convolutional neural network
    Liu T.
    Lu Z.
    Sun Y.
    Liu F.
    He B.
    Zhong J.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (08): : 2143 - 2156