Silhouette Pose Feature-Based Human Action Classification Using Capsule Network

被引：2

作者：

Saif, A. F. M. Saifuddin ^{[1
]}

Khan, Md Akib Shahriar ^{[1
]}

Hadi, Abir Mohammad ^{[1
]}

Karmoker, Rahul Proshad ^{[1
]}

Gomes, Joy Julian ^{[1
]}

机构：

[1] Amer Int Univ, Dhaka, Bangladesh

来源：

JOURNAL OF INFORMATION TECHNOLOGY RESEARCH | 2021年 / 14卷 / 02期

关键词：

Action Classification; Artificial Intelligence; Computer Vision; Image Processing; Machine Learning; Pattern Recognition; ACTION RECOGNITION; IMAGE;

D O I：

10.4018/JITR.2021040106

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Recent years have seen a rise in the use of various machine learning techniques in computer vision, particularly in posing feature-based human action recognition which includes convolutional neural networks (CNN) and recurrent neural network (RNN). CNN-based methods are useful in recognizing human actions for combined motions (i.e., standing up, hand shaking, walking). However, in case of uncertainty of camera motion, occlusion, and multiple people, CNN suppresses important feature information and is not efficient enough to recognize variations for human action. Besides, RNN with long short-term memory (LSTM) requires more computational power to retain memories to classify human actions. This research proposes an extended framework based on capsule network using silhouette pose features to recognize human actions. Proposed extended framework achieved high accuracy of 95.64% which is higher than previous research methodology. Extensive experimental validation of the proposed extended framework reveals efficiency which is expected to contribute significantly in action recognition research.

引用

页码：106 / 124

页数：19

共 48 条

[1]

Afshar P, 2018, IEEE IMAGE PROC, P3129, DOI 10.1109/ICIP.2018.8451379

[2]

Angelini F., 2018, ARXIV181012126

[3]

Anisuzzaman D., 2018, INT J IMAGE GRAPHICS, V10

[4]

Anisuzzaman D., 2018, INT J COMPUTERS APPL, V182, P35, DOI [10.5120/ijca2018917855, DOI 10.5120/IJCA2018917855]

[5]

[Anonymous], 2018, ARXIV180404241

[6]

[Anonymous], 2013, ARXIV201313013557

[7]

[Anonymous], 2012, ARXIV PREPRINT ARXIV

[8]

Baccouche Moez, 2011, Human Behavior Unterstanding. Proceedings Second International Workshop, HBU 2011, P29, DOI 10.1007/978-3-642-25446-8_4

[9] Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition [J].

Bagautdinov, Timur ;

Alahi, Alexandre ;

Fleuret, Francois ;

Fua, Pascal ;

Savarese, Silvio .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3425-3434

[10] Action Recognition by Time Series of Retinotopic Appearance and Motion Features [J].

Barrett, Daniel Paul ;

Siskind, Jeffrey Mark .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (12) :2250-2263

← 1 2 3 4 5 →