Action recognition method based on a novel keyframe extraction method and enhanced 3D convolutional neural network

被引:1
|
作者
Tian, Qiuhong [1 ]
Li, Saiwei [1 ]
Zhang, Yuankui [1 ]
Lu, Hongyi [1 ]
Pan, Hao [1 ]
机构
[1] Zhejiang Sci Tech Univ, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; 3D attention mechanism; Keyframe extraction; 3D residual structure;
D O I
10.1007/s13042-024-02235-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, action recognition is a challenging task in the field of computer vision. Traditional action recognition methods cannot fully extract the spatiotemporal features of actions in video. To address the problem, an action recognition method based on keyframe extraction and DAMR_3DNet (D3DNet+3D Attention Mechanism module+3D Residual module) is proposed. Firstly, we explore a keyframe extraction method based on image information entropy and hog_ssim similarity algorithm, which selects keyframes from the input video to represent video content. And we take the selected keyframes as the model input to reduce the computational complexity of network model. Afterward, we design a DAMR_3DNet model to recognize action and reduce the parameters of network. The D3DNet module improves the C3D network by using the 3D decoupled convolution substituting the 3D convolution and introducing a feature fusion layer. And a 3D attention mechanism is designed to strengthen the action features and reduce the influence of background features. Finally, a 3D residual structure is explored to avoid gradient disappearance while fusing the high-level and low-level spatiotemporal features. Experiments consistently show the superiority of the proposed method on UCF101, Chinese sign language (CSL) and HMDB51 datasets. And the results demonstrate that the proposed method is effective, which improves the performance of action recognition and outperforms the most state-of-the-art methods.
引用
收藏
页码:475 / 491
页数:17
相关论文
共 50 条
  • [1] 3D Face Recognition Method Based on Deep Convolutional Neural Network
    Feng, Jianying
    Guo, Qian
    Guan, Yudong
    Wu, Mengdie
    Zhang, Xingrui
    Ti, Chunli
    SMART INNOVATIONS IN COMMUNICATION AND COMPUTATIONAL SCIENCES, VOL 2, 2019, 670 : 123 - 130
  • [2] Behavior recognition method based on improved 3D convolutional neural network
    Zhang X.
    Li C.
    Sun L.
    Zhang M.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (08): : 2000 - 2006
  • [3] 3D Convolutional Neural Network for Action Recognition
    Zhang, Junhui
    Chen, Li
    Tian, Jing
    COMPUTER VISION, PT I, 2017, 771 : 600 - 607
  • [4] Human Action Recognition with 3D Convolutional Neural Network
    Lima, Tiago
    Fernandes, Bruno
    Barros, Pablo
    2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,
  • [5] Action recognition method based on lightweight network and rough-fine keyframe extraction
    Pan, Hao
    Tian, Qiuhong
    Li, Saiwei
    Miao, Weilun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
  • [6] Enhanced 3D Action Recognition Based on Deep Neural Network
    Park, Sungjoo
    Kim, Dongchil
    2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 470 - 472
  • [7] Keys for Action: An Efficient Keyframe-Based Approach for 3D Action Recognition Using a Deep Neural Network
    Yasin, Hashim
    Hussain, Mazhar
    Weber, Andreas
    SENSORS, 2020, 20 (08)
  • [8] An optimal 3D convolutional neural network based lipreading method
    He, Lun
    Ding, Biyun
    Wang, Hao
    Zhang, Tao
    IET IMAGE PROCESSING, 2022, 16 (01) : 113 - 122
  • [9] ENHANCED ACTION RECOGNITION WITH VISUAL ATTRIBUTE-AUGMENTED 3D CONVOLUTIONAL NEURAL NETWORK
    Wang, Yunfeng
    Zhou, Wengang
    Zhang, Qilin
    Li, Houqiang
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [10] An improved memristor-based 3D Convolutional Neural Network for action recognition
    Wang, Yining
    Li, Ke
    Shen, Siyuan
    Duan, Shukai
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707