A strong benchmark for yoga action recognition based on lightweight pose estimation model

被引：0

作者：

Zhou, Liangtai ^{[1
]}

Zhang, Weiwei ^{[1
]}

Zhang, Banghui ^{[1
]}

Li, Xiaobin ^{[1
]}

Zhu, Jianqing ^{[1
]}

机构：

[1] Huaqiao Univ, Coll Engn, Chenghua North Rd, Quanzhou 362021, Fujian, Peoples R China

来源：

MULTIMEDIA SYSTEMS | 2025年 / 31卷 / 01期

关键词：

Pose estimation; Action recognition; Knowledge distillation; 3D-CNN;

D O I：

10.1007/s00530-024-01646-9

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Yoga action recognition is crucial for enabling precise motion analysis and providing effective training guidance, which in turn facilitates the optimization of physical health and skill enhancement. However, current methods struggle to maintain high accuracy and real-time performance when dealing with the complex poses and occlusions. Additionally, these methods neglect the dynamic characteristics and temporal sequence information inherent in yoga actions. Therefore, this paper proposes a two-stage action recognition method tailored for yoga scenarios. The method initially employs pose estimation technology based on knowledge distillation to optimize the accuracy and efficiency of lightweight models in detecting complex poses and occlusions. Subsequently, a lightweight 3D convolutional neural network (3D-CNN) is utilized for action recognition, achieving seamless integration of the two stages through heat maps, thereby enhancing recognition accuracy and precisely capturing spatiotemporal features in video sequences. Experimental results indicate that on the COCO dataset, the DistillPose-m model achieves a 2.5% improvement in Average Precision (AP) compared to RTMPose-m. In the yoga action recognition task, our model exhibites approximately a 2% improvement over traditional Graph Convolutional Network (GCN) methods on both the Deepyoga and 3Dyoga90 datasets. This study enhances the performance and accuracy of pose estimation in yoga scenarios, addressing the challenges of bodily occlusions and complex postures. By fully leveraging the spatiotemporal information inherent in yoga movements, it improves the accuracy of yoga action recognition. This research provides critical insights and support for motion training and analysis systems in other dynamic activities, such as martial arts and dance.

引用

页数：20

共 50 条

[21] Joint Action Recognition and Pose Estimation From Video
Nie, Bruce Xiaohan
Xiong, Caiming
Zhu, Song-Chun
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1293 - 1301
[22] A Self Learning Yoga Monitoring System Based on Pose Estimation
Movva, Prahitha
Pasupuleti, Hemanth
Sarma, Himangshu
HUMAN-COMPUTER INTERACTION: TECHNOLOGICAL INNOVATION, PT II, 2022, 13303 : 81 - 91
[23] Lightweight Human Pose Estimation Network Based on HRNet
Liang Q.
Wu Y.
Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2023, 50 (02): : 112 - 121
[24] Event-Based Head Pose Estimation: Benchmark and Method
Yuan, Jiahui
Li, Hebei
Peng, Yansong
Wang, Jin
Jiang, Yuheng
Zhang, Yueyi
Sun, Xiaoyan
COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 191 - 208
[25] A lightweight convolutional neural network for pose estimation of a planar model
Ocegueda-Hernandez, Vladimir
Roman-Godinez, Israel
Mendizabal-Ruiz, Gerardo
MACHINE VISION AND APPLICATIONS, 2022, 33 (03)
[26] LIGHTPOSE: A LIGHTWEIGHT AND EFFICIENT MODEL WITH TRANSFORMER FOR HUMAN POSE ESTIMATION
Liu, Xiyang
Li, Peng
Ni, Ding
Wang, Yan
Xue, Hui
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2674 - 2678
[27] Action Recognition Algorithm based on 2D Human Pose Estimation Method
Yu, Chongkai
Chen, Wenjie
Li, Ye
Chen, Chen
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7366 - 7370
[28] A lightweight convolutional neural network for pose estimation of a planar model
Vladimir Ocegueda-Hernández
Israel Román-Godínez
Gerardo Mendizabal-Ruiz
Machine Vision and Applications, 2022, 33
[29] EfficientPose: A Lightweight and Efficient Model with Transformer for Human Pose Estimation
Liang, Wei
Cheng, Zhang
Han, Junjia
Wang, Yanxia
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 120 - 131
[30] An approach to pose-based action recognition
Wang, Chunyu
Wang, Yizhou
Yuille, Alan L.
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 915 - 922

← 1 2 3 4 5 →