An Improved Two-stream 3D Convolutional Neural Network for Human Action Recognition

被引:4
|
作者
Chen, Jun [1 ]
Xu, Yuanping [1 ]
Zhang, Chaolong [1 ,2 ]
Xu, Zhijie [2 ]
Meng, Xiangxiang [1 ]
Wang, Jie [1 ]
机构
[1] Chengdu Univ Informat Technol, Sch Software Engn, Chengdu, Peoples R China
[2] Univ Huddersfield, Sch Comp & Engn, Huddersfield, W Yorkshire, England
来源
2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC) | 2019年
关键词
Optical Flow; Human Action Recognition; Two-stream CNN; Three-dimensional CNN;
D O I
10.23919/iconac.2019.8894962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to obtain global contextual information precisely from videos with heavy camera motions and scene changes, this study proposes an improved spatiotemporal two-stream neural network architecture with a novel convolutional fusion layer. The three main improvements of this study are: 1) the Resnet-101 network has been integrated into the two streams of the target network independently; 2) two kinds of feature maps (i.e., the optical flow motion and RGB-channel information) obtained by the corresponding convolution layer of two streams respectively are superimposed on each other; 3) the temporal information is combined with the spatial information by the integrated three-dimensional (3D) convolutional neural network (CNN) to extract more latent information from the videos. The proposed approach was tested by using UCF-101 and HMDB51 benchmarking datasets and the experimental results show that the proposed two-stream 3D CNN model can gain substantial improvement on the recognition rate in video-based analysis.
引用
收藏
页码:135 / 140
页数:6
相关论文
共 50 条
  • [31] Two-Stream Network with 3D Common-Specific Framework for RGB-D Action Recognition
    Qin, Xiaolei
    Ge, Yongxin
    Feng, Jinyuan
    Chen, Yida
    Zhan, Liuwei
    Wang, Xuchu
    Wang, Yuangan
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 731 - 738
  • [32] Kinematics Features for 3D Action Recognition Using Two-Stream CNN
    Wang, Jiangliu
    Liu, Yunhui
    2018 13TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2018, : 1731 - 1736
  • [33] Human Instance Segmentation Based on Two-Stream Convolutional Neural Network
    Ma Zitong
    Wang Guodong
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (16)
  • [34] Two-Stream Mixed Convolutional Neural Network for American Sign Language Recognition
    Ma, Ying
    Xu, Tianpei
    Kim, Kangchul
    SENSORS, 2022, 22 (16)
  • [35] 3D Convolutional Neural Networks for Human Action Recognition
    Ji, Shuiwang
    Xu, Wei
    Yang, Ming
    Yu, Kai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) : 221 - 231
  • [36] Two-Stream Convolutional Neural Network for Multimodal Matching
    Zhang, Youcai
    Gu, Yiwei
    Gu, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 14 - 21
  • [37] Spatial-temporal multiscale feature optimization based two-stream convolutional neural network for action recognition
    Xia, Limin
    Fu, Weiye
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 11611 - 11626
  • [38] A Novel Motion Recognition Method Based on Improved Two-stream Convolutional Neural Network and Sparse Feature Fusion
    Chen, Chen
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2022, 19 (03) : 1329 - 1348
  • [39] Two-stream Graph Attention Convolutional for Video Action Recognition
    Zhang, Deyuan
    Gao, Hongwei
    Dai, Hailong
    Shi, Xiangbin
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (BIGDATASE 2021), 2021, : 23 - 27
  • [40] Deep Convolutional Neural Network Based on Two-Stream Convolutional Unit
    Hou Congcong
    He Yuqing
    Jiang Xiaoheng
    Pan Jing
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (02)