Action recognition by key trajectories

被引：0

作者：

Fernando Camarena

Leonardo Chang

Miguel Gonzalez-Mendoza

Ricardo J Cuevas-Ascencio

机构：

[1] Tecnológico de Monterrey,School of Engineering and Science

[2] Campus Estado de México,undefined

来源：

Pattern Analysis and Applications | 2022年 / 25卷

关键词：

Action recognition; Pose estimation; Dense trajectories; Key trajectories;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Human action recognition is an active field of research that intends to explain what a subject is doing in an input video. Deep learning architectures serve as the foundation for cutting-edge approaches. Recent research, on the other hand, indicates that hand-crafted characteristics are complementary and, when combined, can enhance classification accuracy. Cutting-edge approaches are based on deep learning architectures. Recent research, however, indicates that hand-crafted features complement each other and can help boost classification accuracy when combined. We introduce the key trajectories approach that is based on the popular, hand-crafted method, improved dense trajectories. Our work explores how pose estimation can be used to find meaningful key points to reduce computational time, undesired noise, and to guarantee a stable frame processing rate. Furthermore, we tested how feature-tracking behaves with dense inverse search and with a frame to frame subject key point estimation. Our proposal was tested on the KTH and UCF11 datasets employing Bag-of-words and on the UCF50 and HMDB datasets using Fisher Vector, where we got an accuracy performance of 95.71, 84.88, 92.9, and 81.3%, respectively. Also, our proposal can recognize subject actions in video eight times faster compared to its dense counterpart. To maximize the bag-of-words classification performance, we illustrate how the hyperparameters affect both accuracy and computation time. Precisely, we present an exploration of the vocabulary size, the SVM hyperparameter, the descriptor’s distinctiveness, and the subject body key points.

引用

页码：409 / 423

页数：14

共 26 条

[1]

Bay H(2008)Speeded-up robust features (surf) Comput Vis Image Underst 110 346-359

[2]

Ess A(2004)Hidden markov models Lect Notes August 15 48-21

[3]

Tuytelaars T(2018)Bagging-randomminer: a one-class classifier for file access-based masquerade detection Mach Vis Appl 60 4-123

[4]

Van Gool L(2017)Going deeper into action recognition: a survey Image Vis Comput 64 107-110

[5]

Blunsom P(2005)On space-time interest points Int J Comput Vis 60 91-981

[6]

Camiña JB(2004)Distinctive image features from scale-invariant keypoints Int J Comput Vis 24 971-1520

[7]

Medina-Pérez MA(2013)Recognizing 50 human action categories of web videos Mach Vis Appl 19 1510-79

[8]

Monroy R(2017)Sequential deep trajectory descriptor for action recognition with three-stream CNN IEEE Trans Multimed 103 60-undefined

[9]

Loyola-González O(2013)Dense trajectories and motion boundary descriptors for action recognition Int J Comput Vis undefined undefined-undefined

[10]

Villanueva LAP(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 →