A strong benchmark for yoga action recognition based on lightweight pose estimation model

被引:0
|
作者
Zhou, Liangtai [1 ]
Zhang, Weiwei [1 ]
Zhang, Banghui [1 ]
Li, Xiaobin [1 ]
Zhu, Jianqing [1 ]
机构
[1] Huaqiao Univ, Coll Engn, Chenghua North Rd, Quanzhou 362021, Fujian, Peoples R China
关键词
Pose estimation; Action recognition; Knowledge distillation; 3D-CNN;
D O I
10.1007/s00530-024-01646-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Yoga action recognition is crucial for enabling precise motion analysis and providing effective training guidance, which in turn facilitates the optimization of physical health and skill enhancement. However, current methods struggle to maintain high accuracy and real-time performance when dealing with the complex poses and occlusions. Additionally, these methods neglect the dynamic characteristics and temporal sequence information inherent in yoga actions. Therefore, this paper proposes a two-stage action recognition method tailored for yoga scenarios. The method initially employs pose estimation technology based on knowledge distillation to optimize the accuracy and efficiency of lightweight models in detecting complex poses and occlusions. Subsequently, a lightweight 3D convolutional neural network (3D-CNN) is utilized for action recognition, achieving seamless integration of the two stages through heat maps, thereby enhancing recognition accuracy and precisely capturing spatiotemporal features in video sequences. Experimental results indicate that on the COCO dataset, the DistillPose-m model achieves a 2.5% improvement in Average Precision (AP) compared to RTMPose-m. In the yoga action recognition task, our model exhibites approximately a 2% improvement over traditional Graph Convolutional Network (GCN) methods on both the Deepyoga and 3Dyoga90 datasets. This study enhances the performance and accuracy of pose estimation in yoga scenarios, addressing the challenges of bodily occlusions and complex postures. By fully leveraging the spatiotemporal information inherent in yoga movements, it improves the accuracy of yoga action recognition. This research provides critical insights and support for motion training and analysis systems in other dynamic activities, such as martial arts and dance.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Joint Action Recognition and Pose Estimation From Video
    Nie, Bruce Xiaohan
    Xiong, Caiming
    Zhu, Song-Chun
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1293 - 1301
  • [22] A Self Learning Yoga Monitoring System Based on Pose Estimation
    Movva, Prahitha
    Pasupuleti, Hemanth
    Sarma, Himangshu
    HUMAN-COMPUTER INTERACTION: TECHNOLOGICAL INNOVATION, PT II, 2022, 13303 : 81 - 91
  • [23] Lightweight Human Pose Estimation Network Based on HRNet
    Liang Q.
    Wu Y.
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2023, 50 (02): : 112 - 121
  • [24] Event-Based Head Pose Estimation: Benchmark and Method
    Yuan, Jiahui
    Li, Hebei
    Peng, Yansong
    Wang, Jin
    Jiang, Yuheng
    Zhang, Yueyi
    Sun, Xiaoyan
    COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 191 - 208
  • [25] A lightweight convolutional neural network for pose estimation of a planar model
    Ocegueda-Hernandez, Vladimir
    Roman-Godinez, Israel
    Mendizabal-Ruiz, Gerardo
    MACHINE VISION AND APPLICATIONS, 2022, 33 (03)
  • [26] LIGHTPOSE: A LIGHTWEIGHT AND EFFICIENT MODEL WITH TRANSFORMER FOR HUMAN POSE ESTIMATION
    Liu, Xiyang
    Li, Peng
    Ni, Ding
    Wang, Yan
    Xue, Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2674 - 2678
  • [27] Action Recognition Algorithm based on 2D Human Pose Estimation Method
    Yu, Chongkai
    Chen, Wenjie
    Li, Ye
    Chen, Chen
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7366 - 7370
  • [28] A lightweight convolutional neural network for pose estimation of a planar model
    Vladimir Ocegueda-Hernández
    Israel Román-Godínez
    Gerardo Mendizabal-Ruiz
    Machine Vision and Applications, 2022, 33
  • [29] EfficientPose: A Lightweight and Efficient Model with Transformer for Human Pose Estimation
    Liang, Wei
    Cheng, Zhang
    Han, Junjia
    Wang, Yanxia
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 120 - 131
  • [30] An approach to pose-based action recognition
    Wang, Chunyu
    Wang, Yizhou
    Yuille, Alan L.
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 915 - 922