Action recognition method based on a novel keyframe extraction method and enhanced 3D convolutional neural network

被引：1

作者：

Tian, Qiuhong ^{[1
]}

Li, Saiwei ^{[1
]}

Zhang, Yuankui ^{[1
]}

Lu, Hongyi ^{[1
]}

Pan, Hao ^{[1
]}

机构：

[1] Zhejiang Sci Tech Univ, Hangzhou 310018, Zhejiang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2025年 / 16卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Action recognition; 3D attention mechanism; Keyframe extraction; 3D residual structure;

D O I：

10.1007/s13042-024-02235-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

At present, action recognition is a challenging task in the field of computer vision. Traditional action recognition methods cannot fully extract the spatiotemporal features of actions in video. To address the problem, an action recognition method based on keyframe extraction and DAMR_3DNet (D3DNet+3D Attention Mechanism module+3D Residual module) is proposed. Firstly, we explore a keyframe extraction method based on image information entropy and hog_ssim similarity algorithm, which selects keyframes from the input video to represent video content. And we take the selected keyframes as the model input to reduce the computational complexity of network model. Afterward, we design a DAMR_3DNet model to recognize action and reduce the parameters of network. The D3DNet module improves the C3D network by using the 3D decoupled convolution substituting the 3D convolution and introducing a feature fusion layer. And a 3D attention mechanism is designed to strengthen the action features and reduce the influence of background features. Finally, a 3D residual structure is explored to avoid gradient disappearance while fusing the high-level and low-level spatiotemporal features. Experiments consistently show the superiority of the proposed method on UCF101, Chinese sign language (CSL) and HMDB51 datasets. And the results demonstrate that the proposed method is effective, which improves the performance of action recognition and outperforms the most state-of-the-art methods.

引用

页码：475 / 491

页数：17

共 50 条

[1] 3D Face Recognition Method Based on Deep Convolutional Neural Network
Feng, Jianying
Guo, Qian
Guan, Yudong
Wu, Mengdie
Zhang, Xingrui
Ti, Chunli
SMART INNOVATIONS IN COMMUNICATION AND COMPUTATIONAL SCIENCES, VOL 2, 2019, 670 : 123 - 130
[2] Behavior recognition method based on improved 3D convolutional neural network
Zhang X.
Li C.
Sun L.
Zhang M.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (08): : 2000 - 2006
[3] 3D Convolutional Neural Network for Action Recognition
Zhang, Junhui
Chen, Li
Tian, Jing
COMPUTER VISION, PT I, 2017, 771 : 600 - 607
[4] Human Action Recognition with 3D Convolutional Neural Network
Lima, Tiago
Fernandes, Bruno
Barros, Pablo
2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,
[5] Action recognition method based on lightweight network and rough-fine keyframe extraction
Pan, Hao
Tian, Qiuhong
Li, Saiwei
Miao, Weilun
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
[6] Enhanced 3D Action Recognition Based on Deep Neural Network
Park, Sungjoo
Kim, Dongchil
2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 470 - 472
[7] Keys for Action: An Efficient Keyframe-Based Approach for 3D Action Recognition Using a Deep Neural Network
Yasin, Hashim
Hussain, Mazhar
Weber, Andreas
SENSORS, 2020, 20 (08)
[8] An optimal 3D convolutional neural network based lipreading method
He, Lun
Ding, Biyun
Wang, Hao
Zhang, Tao
IET IMAGE PROCESSING, 2022, 16 (01) : 113 - 122
[9] ENHANCED ACTION RECOGNITION WITH VISUAL ATTRIBUTE-AUGMENTED 3D CONVOLUTIONAL NEURAL NETWORK
Wang, Yunfeng
Zhou, Wengang
Zhang, Qilin
Li, Houqiang
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
[10] An improved memristor-based 3D Convolutional Neural Network for action recognition
Wang, Yining
Li, Ke
Shen, Siyuan
Duan, Shukai
Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707

← 1 2 3 4 5 →