Human Activity Recognition Based on Deep-Temporal Learning Using Convolution Neural Networks Features and Bidirectional Gated Recurrent Unit With Features Selection

被引：32

作者：

Ahmad, Tariq ^{[1
]}

Wu, Jinsong ^{[2
,3
]}

Alwageed, Hathal Salamah ^{[4
]}

Khan, Faheem ^{[5
]}

Khan, Jawad ^{[6
]}

Lee, Youngmoon ^{[6
]}

机构：

[1] Guilin Univ Elect Technol, Sch Informat & Commun Engn, Guilin 541004, Peoples R China

[2] Guilin Univ Elect Technol, Sch Artificial Intelligence, Guilin 510004, Peoples R China

[3] Univ Chile, Dept Elect Engn, Santiago 8370451, Chile

[4] Jouf Univ, Coll Comp & Informat Sci, Sakakah 72314, Saudi Arabia

[5] Gachon Univ, Dept Comp Engn, Seongnam 13120, South Korea

[6] Hanyang Univ, Dept Robot, Ansan 15588, South Korea

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Feature extraction; Visualization; Computational modeling; Three-dimensional displays; Data mining; Logic gates; Face recognition; Human activity recognition; recurrent neural networks (RNNs); convolution neural networks (CNNs); bidirectional-gated recurrent unit (Bi-GRU); deep learning;

D O I：

10.1109/ACCESS.2023.3263155

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recurrent Neural Networks (RNNs) and their variants have been demonstrated tremendous successes in modeling sequential data such as audio processing, video processing, time series analysis, and text mining. Inspired by these facts, we propose human activity recognition technique to proceed visual data via utilizing convolution neural network (CNN) and Bidirectional-gated recurrent unit (Bi-GRU). Firstly, we extract deep features from frames sequence of human activities videos using CNN and then select most important features from the deep appearances to improve performance and decrease computational complexity of the model. Secondly, to learn temporal motions of frames sequence, we design Bi-GRU and feed those deep-important features extracted from frames sequence of human activities to Bi-GRU which learn temporal dynamics in forward and backward direction at each time step. We conduct extensive experiments on realistic videos of human activity recognition datasets YouTube11, HMDB51 and UCF101. Lastly, we compare the obtained results with existing methods to show the competence of our proposed technique.

引用

页码：33148 / 33159

页数：12

共 54 条

[1] SDIGRU: Spatial and Deep Features Integration Using Multilayer Gated Recurrent Unit for Human Activity Recognition [J].

Ahmad, Tariq ;

Wu, Jinsong .

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) :973-985

[2]

Ahmad T, 2021, INT J ADV COMPUT SC, V12, P18

[3]

An S., 2022, arXiv

[4] Applying Machine Learning to Stem Cell Culture and Differentiation [J].

Ashraf, Mishal ;

Khalilitousi, Mohammadali ;

Laksman, Zachary .

CURRENT PROTOCOLS, 2021, 1 (09)

[5]

Ballas N., 2015, DELVING DEEPER CONVO

[6] The recognition of human movement using temporal templates [J].

Bobick, AF ;

Davis, JW .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267

[7]

Bregonzio M, 2009, PROC CVPR IEEE, P1948, DOI 10.1109/CVPRW.2009.5206779

[8]

Chollet F, 2017, Arxiv, DOI arXiv:1610.02357

[9]

Couturier R., 2021, arXiv

[10]

Deng J, 2009, 2009 IEEE C COMP VIS, P248

← 1 2 3 4 5 6 →