Video-Based Recognition of Human Activity Using Novel Feature Extraction Techniques

被引：2

作者：

Issa, Obada ^{[1
]}

Shanableh, Tamer ^{[1
]}

机构：

[1] Amer Univ Sharjah, Dept Comp Sci & Engn, POB 26666, Sharjah, U Arab Emirates

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 11期

关键词：

activity recognition; high-efficiency video coding; machine learning; motion vectors; NETWORK;

D O I：

10.3390/app13116856

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This paper proposes a novel approach to activity recognition where videos are compressed using video coding to generate feature vectors based on compression variables. We propose to eliminate the temporal domain of feature vectors by computing the mean and standard deviation of each variable across all video frames. Thus, each video is represented by a single feature vector of 67 variables. As for the motion vectors, we eliminated their temporal domain by projecting their phases using PCA, thus representing each video by a single feature vector with a length equal to the number of frames in a video. Consequently, complex classifiers such as LSTM can be avoided and classical machine learning techniques can be used instead. Experimental results on the JHMDB dataset resulted in average classification accuracies of 68.8% and 74.2% when using the projected phases of motion vectors and video coding feature variables, respectively. The advantage of the proposed solution is the use of FVs with low dimensionality and simple machine learning techniques.

引用

页数：11

共 28 条

[1] Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition [J].

Ahsan, Unaiza ;

Madhok, Rishi ;

Essa, Irfan .

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :179-189

[2] Classifying, Segmenting, and Tracking Object Instances in Video with Mask Propagation [J].

Bertasius, Gedas ;

Torresani, Lorenzo .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9736-9745

[3] Dynamic Image Networks for Action Recognition [J].

Bilen, Hakan ;

Fernando, Basura ;

Gavves, Efstratios ;

Vedaldi, Andrea ;

Gould, Stephen .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3034-3042

[4] Random forests [J].

Breiman, L .

MACHINE LEARNING, 2001, 45 (01) :5-32

[5]

Cherian A, 2017, Arxiv, DOI arXiv:1704.02112

[6]

Ch‚ron G, 2015, Arxiv, DOI arXiv:1506.03607

[7] PoTion: Pose MoTion Representation for Action Recognition [J].

Choutas, Vasileios ;

Weinzaepfel, Philippe ;

Revaud, Jerome ;

Schmid, Cordelia .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7024-7033

[8] Large-scale weakly-supervised pre-training for video action recognition [J].

Ghadiyaram, Deepti ;

Du Tran ;

Mahajan, Dhruv .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12038-12047

[9]

Gkioxari G, 2014, Arxiv, DOI arXiv:1411.6031

[10]

Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]

← 1 2 3 →