An Improved Two-stream 3D Convolutional Neural Network for Human Action Recognition

被引:4
|
作者
Chen, Jun [1 ]
Xu, Yuanping [1 ]
Zhang, Chaolong [1 ,2 ]
Xu, Zhijie [2 ]
Meng, Xiangxiang [1 ]
Wang, Jie [1 ]
机构
[1] Chengdu Univ Informat Technol, Sch Software Engn, Chengdu, Peoples R China
[2] Univ Huddersfield, Sch Comp & Engn, Huddersfield, W Yorkshire, England
来源
2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC) | 2019年
关键词
Optical Flow; Human Action Recognition; Two-stream CNN; Three-dimensional CNN;
D O I
10.23919/iconac.2019.8894962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to obtain global contextual information precisely from videos with heavy camera motions and scene changes, this study proposes an improved spatiotemporal two-stream neural network architecture with a novel convolutional fusion layer. The three main improvements of this study are: 1) the Resnet-101 network has been integrated into the two streams of the target network independently; 2) two kinds of feature maps (i.e., the optical flow motion and RGB-channel information) obtained by the corresponding convolution layer of two streams respectively are superimposed on each other; 3) the temporal information is combined with the spatial information by the integrated three-dimensional (3D) convolutional neural network (CNN) to extract more latent information from the videos. The proposed approach was tested by using UCF-101 and HMDB51 benchmarking datasets and the experimental results show that the proposed two-stream 3D CNN model can gain substantial improvement on the recognition rate in video-based analysis.
引用
收藏
页码:135 / 140
页数:6
相关论文
共 50 条
  • [41] Driver Distraction Recognition with Pose-aware Two-stream Convolutional Neural Network
    Tao, Chenghao
    Ma, Sheqiang
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12790
  • [42] GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition
    Liu, Jiaheng
    Xu, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4711 - 4721
  • [43] Two-Stream Convolutional Neural Networks for Emergency Recognition in Images
    Chen, Jia
    Duan, Shihui
    Long, Fei
    Wang, Yongxing
    Wang, Song
    Ling, Qiang
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6470 - 6474
  • [44] Abnormal Behavior Detection in Uncrowded Videos with Two-Stream 3D Convolutional Neural Networks
    Mehmood, Abid
    APPLIED SCIENCES-BASEL, 2021, 11 (08):
  • [45] A Spatiotemporal Heterogeneous Two-Stream Network for Action Recognition
    Chen, Enqing
    Bai, Xue
    Gao, Lei
    Tinega, Haron Chweya
    Ding, Yingqiang
    IEEE ACCESS, 2019, 7 : 57267 - 57275
  • [46] Two-Stream Network Based on Visual Saliency Sharing for 3D Model Recognition
    Nie, Weizhi
    Qu, Lu
    Ren, Minjie
    Liang, Qi
    Su, Yuting
    Li, Yangyang
    Jin, Hao
    IEEE ACCESS, 2020, 8 : 5979 - 5989
  • [47] Action Recognition by 3D Convolutional Network
    Brezovsky, Matus
    Sopiak, Dominik
    Oravec, Milos
    PROCEEDINGS OF ELMAR-2018: 60TH INTERNATIONAL SYMPOSIUM ELMAR-2018, 2018, : 71 - 74
  • [48] A Multimode Two-Stream Network for Egocentric Action Recognition
    Li, Ying
    Shen, Jie
    Xiong, Xin
    He, Wei
    Li, Peng
    Yan, Wenjie
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 357 - 368
  • [49] Two-Stream Temporal Convolutional Networks for Skeleton-Based Human Action Recognition
    Jin-Gong Jia
    Yuan-Feng Zhou
    Xing-Wei Hao
    Feng Li
    Christian Desrosiers
    Cai-Ming Zhang
    Journal of Computer Science and Technology, 2020, 35 : 538 - 550
  • [50] Two-Stream Temporal Convolutional Networks for Skeleton-Based Human Action Recognition
    Jia, Jin-Gong
    Zhou, Yuan-Feng
    Hao, Xing-Wei
    Li, Feng
    Desrosiers, Christian
    Zhang, Cai-Ming
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (03) : 538 - 550