Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network

被引:6
|
作者
Xiong, Xin [1 ,2 ,3 ]
Min, Weidong [2 ,3 ,4 ]
Han, Qing [4 ]
Wang, Qi [5 ]
Zha, Cheng [4 ]
机构
[1] Nanchang Univ, Affiliated Hosp 1, Informat Dept, Nanchang 330006, Peoples R China
[2] Nanchang Univ, Inst Metaverse, Nanchang 330031, Peoples R China
[3] Jiangxi Key Lab Smart City, Nanchang 330047, Peoples R China
[4] Nanchang Univ, Sch Math & Comp Sci, Nanchang 330031, Peoples R China
[5] Nanchang Univ, Sch Software, Nanchang 330047, Peoples R China
基金
中国国家自然科学基金;
关键词
SPATIAL-TEMPORAL ATTENTION; CONVOLUTIONAL NETWORKS; VIDEO; LSTM;
D O I
10.1155/2022/6608448
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB flow alone or combined with optical flow. A novel recognition method using action sequences optimization and two-stream fusion network with different modalities is proposed to solve these problems. The method is based on shot segmentation and dynamic weighted sampling, and it reconstructs the video by reinforcing the proportion of high-activity action areas, eliminating redundant intervals, and extracting long-range temporal information. A two-stream 3D dilated neural network that integrates features of RGB and human skeleton information is also proposed. The human skeleton information strengthens the deep representation of humans for robust processing, alleviating the interference of background changes, and the dilated CNN enlarges the receptive field of feature extraction. Compared with existing approaches, the proposed method achieves superior or comparable classification accuracies on benchmark datasets UCF101 and HMDB51.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] An Improved Two-stream 3D Convolutional Neural Network for Human Action Recognition
    Chen, Jun
    Xu, Yuanping
    Zhang, Chaolong
    Xu, Zhijie
    Meng, Xiangxiang
    Wang, Jie
    2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 135 - 140
  • [2] Improving human action recognition with two-stream 3D convolutional neural network
    Van-Minh Khong
    Thanh-Hai Tran
    2018 1ST INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2018,
  • [3] Two-Stream 3D Convolution Attentional Network for Action Recognition
    Kusumoseniarto, Raden Hadapiningsyah
    2020 JOINT 9TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2020 4TH INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2020,
  • [4] 3D Convolutional Two-Stream Network for Action Recognition in Videos
    Li, Min
    Qi, Yuezhu
    Yang, Jian
    Zhang, Yanfang
    Ren, Junxing
    Du, Hong
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1697 - 1701
  • [5] Kinematics Features for 3D Action Recognition Using Two-Stream CNN
    Wang, Jiangliu
    Liu, Yunhui
    2018 13TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2018, : 1731 - 1736
  • [6] Two-Stream Convolutional Neural Network for Video Action Recognition
    Qiao, Han
    Liu, Shuang
    Xu, Qingzhen
    Liu, Shouqiang
    Yang, Wanggan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (10): : 3668 - 3684
  • [7] Two-Stream RNN/CNN for Action Recognition in 3D Videos
    Zhao, Rui
    Ali, Haider
    van der Smagt, Patrick
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 4260 - 4267
  • [8] Two-Stream Convolution Neural Network with Video-stream for Action Recognition
    Dai, Wei
    Chen, Yimin
    Huang, Chen
    Gao, Ming-Ke
    Zhang, Xinyu
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [9] Transferable two-stream convolutional neural network for human action recognition
    Xiong, Qianqian
    Zhang, Jianjing
    Wang, Peng
    Liu, Dongdong
    Gao, Robert X.
    JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 : 605 - 614
  • [10] Two-Stream Network with 3D Common-Specific Framework for RGB-D Action Recognition
    Qin, Xiaolei
    Ge, Yongxin
    Feng, Jinyuan
    Chen, Yida
    Zhan, Liuwei
    Wang, Xuchu
    Wang, Yuangan
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 731 - 738