Human Activity Recognition Based on Deep-Temporal Learning Using Convolution Neural Networks Features and Bidirectional Gated Recurrent Unit With Features Selection

被引：32

作者：

Ahmad, Tariq ^{[1
]}

Wu, Jinsong ^{[2
,3
]}

Alwageed, Hathal Salamah ^{[4
]}

Khan, Faheem ^{[5
]}

Khan, Jawad ^{[6
]}

Lee, Youngmoon ^{[6
]}

机构：

[1] Guilin Univ Elect Technol, Sch Informat & Commun Engn, Guilin 541004, Peoples R China

[2] Guilin Univ Elect Technol, Sch Artificial Intelligence, Guilin 510004, Peoples R China

[3] Univ Chile, Dept Elect Engn, Santiago 8370451, Chile

[4] Jouf Univ, Coll Comp & Informat Sci, Sakakah 72314, Saudi Arabia

[5] Gachon Univ, Dept Comp Engn, Seongnam 13120, South Korea

[6] Hanyang Univ, Dept Robot, Ansan 15588, South Korea

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Feature extraction; Visualization; Computational modeling; Three-dimensional displays; Data mining; Logic gates; Face recognition; Human activity recognition; recurrent neural networks (RNNs); convolution neural networks (CNNs); bidirectional-gated recurrent unit (Bi-GRU); deep learning;

D O I：

10.1109/ACCESS.2023.3263155

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recurrent Neural Networks (RNNs) and their variants have been demonstrated tremendous successes in modeling sequential data such as audio processing, video processing, time series analysis, and text mining. Inspired by these facts, we propose human activity recognition technique to proceed visual data via utilizing convolution neural network (CNN) and Bidirectional-gated recurrent unit (Bi-GRU). Firstly, we extract deep features from frames sequence of human activities videos using CNN and then select most important features from the deep appearances to improve performance and decrease computational complexity of the model. Secondly, to learn temporal motions of frames sequence, we design Bi-GRU and feed those deep-important features extracted from frames sequence of human activities to Bi-GRU which learn temporal dynamics in forward and backward direction at each time step. We conduct extensive experiments on realistic videos of human activity recognition datasets YouTube11, HMDB51 and UCF101. Lastly, we compare the obtained results with existing methods to show the competence of our proposed technique.

引用

页码：33148 / 33159

页数：12

共 54 条

[11] Skeleton-Based Multifeatures and Multistream Network for Real-Time Action Recognition [J].

Deng, Zhiwen ;

Gao, Qing ;

Ju, Zhaojie ;

Yu, Xiang .

IEEE SENSORS JOURNAL, 2023, 23 (07) :7397-7409

[12]

Donahue J, 2016, Arxiv, DOI arXiv:1411.4389

[13] Learning Spatiotemporal Features with 3D Convolutional Networks [J].

Du Tran ;

Bourdev, Lubomir ;

Fergus, Rob ;

Torresani, Lorenzo ;

Paluri, Manohar .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497

[14] Two Stream LSTM : A Deep Fusion Framework for Human Action Recognition [J].

Gammulle, Harshala ;

Denman, Simon ;

Sridharan, Sridha ;

Fookes, Clinton .

2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :177-186

[15]

Heng Wang, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3169, DOI 10.1109/CVPR.2011.5995407

[16] Action Detection in Complex Scenes with Spatial and Temporal Ambiguities [J].

Hu, Yuxiao ;

Cao, Liangliang ;

Lv, Fengjun ;

Yan, Shuicheng ;

Gong, Yihong ;

Huang, Thomas S. .

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :128-135

[17]

Huang G, 2018, Arxiv, DOI [arXiv:1608.06993, DOI 10.48550/ARXIV.1608.06993, 10.1109/CVPR.2017.243, DOI 10.1109/CVPR.2017.243]

[18] 3D Convolutional Neural Networks for Human Action Recognition [J].

Ji, Shuiwang ;

Xu, Wei ;

Yang, Ming ;

Yu, Kai .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :221-231

[19] Large-scale Video Classification with Convolutional Neural Networks [J].

Karpathy, Andrej ;

Toderici, George ;

Shetty, Sanketh ;

Leung, Thomas ;

Sukthankar, Rahul ;

Fei-Fei, Li .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1725-1732

[20] Effect of Feature Selection on the Accuracy of Music Popularity Classification Using Machine Learning Algorithms [J].

Khan, Faheem ;

Tarimer, Ilhan ;

Alwageed, Hathal Salamah ;

Karadag, Buse Cennet ;

Fayaz, Muhammad ;

Abdusalomov, Akmalbek Bobomirzaevich ;

Cho, Young-Im .

ELECTRONICS, 2022, 11 (21)

← 1 2 3 4 5 6 →