A Framework for Personalized Human Activity Recognition

被引：2

作者：

Eris, Hasan Ali ^{[1
]}

Erturk, Mehmet Ali ^{[2
]}

Aydin, Muhammed Ali ^{[1
]}

机构：

[1] Istanbul Univ Cerrahpasa, Dept Comp Engn, Istanbul, Turkiye

[2] Istanbul Univ, Dept Comp Engn, TR-34452 Istanbul, Turkiye

来源：

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE | 2023年 / 37卷 / 10期

关键词：

Human activity recognition (HAR); CNN; RNN; LSTM; personalized activity recognition;

D O I：

10.1142/S0218001423560165

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In today's world, Human Activity Recognition (HAR) through video streams is actively used in every aspect of our life, such as automated surveillance systems and sports statistics are computed according to the videos with the help of HAR. Activity detection is not a new subject, and several methods are available. However, the most recent and most promising techniques rely on Convolutional Neural Networks (CNNs). CNNs primary usage is based on a single image frame to perform logical or categorical identification of an object, scene, or activity. We exploit this feature to adapt CNN on video streams to achieve HAR. In this study, we present a Personalized HAR (PHAR) framework that increases activity recognition accuracy with Object Detection (OD). First, we demonstrate the state-of-the-art HAR and OD methods in the literature. Then we illustrate our framework with two new Single Person Human Activity Recognition models. Finally, the performance of the new framework is evaluated with the well-known activity detection methods. Results show that our new PHAR model with 95% accuracy ratio outperforms the CNN-LSTM-based reference model (90%). Moreover, a new metric Average Accuracy Score (AAS) is described in this study, PHAR models approximately have 94% AAS, which is better than the reference model with 89% AAS.

引用

页数：28

共 25 条

[1]

Action recognition datasets, 2022, NTU RGB D DAT NTU RG

[2]

Agarwal V., 2022, PROCTORING

[3]

[Anonymous], 2022, HANDS ON GUIDE OBJEC

[4]

Bradski G, 2000, DR DOBBS J, V25, P120

[5] Human action recognition using two-stream attention based LSTM networks [J].

Dai, Cheng ;

Liu, Xingang ;

Lai, Jinfeng .

APPLIED SOFT COMPUTING, 2020, 86

[6] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[7] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[8] Turkish sign language recognition based on multistream data fusion [J].

Gunduz, Cemil ;

Polat, Huseyin .

TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (02) :1171-1186

[9]

Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]

[10]

Kuehne H, 2011, IEEE I CONF COMP VIS, P2556, DOI 10.1109/ICCV.2011.6126543

← 1 2 3 →