A View Independent Classification Framework for Yoga Postures

被引:7
作者
Chasmai M. [1 ]
Das N. [2 ]
Bhardwaj A. [3 ]
Garg R. [1 ]
机构
[1] Computer Science and Engineering, Indian Institute of Technology Delhi, Delhi
[2] Electrical Engineering, Indian Institute of Technology Delhi, Delhi
[3] School of Information Technology, Indian Institute of Technology Delhi, Delhi
关键词
Machine learning; Pose estimation; Transfer learning; Yogasana;
D O I
10.1007/s42979-022-01376-7
中图分类号
学科分类号
摘要
Yoga is a globally acclaimed and widely recommended practice for a healthy living. Maintaining correct posture while performing a Yogasana is of utmost importance. In this work, we employ transfer learning from human pose estimation models for extracting 136 key-points spread all over the body to train a random forest classifier which is used for estimation of the Yogasanas. The results are evaluated on an in-house collected extensive yoga video database of 51 subjects recorded from four different camera angles. We use a three step scheme for evaluating the generalizability of a Yoga classifier by testing it on (1) unseen frames, (2) unseen subjects, and (3) unseen camera angles. We argue that for most of the applications, validation accuracies on unseen subjects and unseen camera angles would be most important. We empirically analyze over three public datasets, the advantage of transfer learning and the possibilities of target leakage. We further demonstrate that the classification accuracies critically depend on the cross validation method employed and can often be misleading. To promote further research, we have made key-points dataset and code publicly available. © 2022, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 57 条
[1]  
Andriluka M., Pishchulin L., Gehler P., Schiele B., 2d human pose estimation: New benchmark and state of the art analysis, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3686-3693, (2014)
[2]  
Li J., Can Z., Hao M., Yihuan F., Hao-Shucewu L., CrowdPose: Efficient Crowded Scenes Pose Estimation and a New Benchmark, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 10855-10864, (2019)
[3]  
Weiyao L., Huabin L., Shuzhanli L., Rui Y., Tao W., Ning X., Hongkaiqi X., Nicu G.-J., Human in Events: A Large-Scale Benchmark for Human-Centric Video Analysis in Complex Events, (2020)
[4]  
Andriluka M., Iqbal U., Insafutdinov E., Pishchulin L., Milan A., Gall J., Schiele B., PoseTrack: A Benchmark for Human Pose Estimation and Tracking, IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5167-5176, (2018)
[5]  
Ma N., Xiangyu Z., Hai-Taojian S., Shufflenet v2: Practical guidelines for efficient cnn architecture design, European Conference on Computer Vision (ECCV), pp. 116-131, (2018)
[6]  
Vision-based human activity recognition: A survey, Multimedia Tools and Applications 79, 41, pp. 30509-30555, (2020)
[7]  
Breiman L., Random forests, Mach Learn, 45, 1, pp. 5-32, (2001)
[8]  
Arndt B., Andreaskhalsa Sat Bir M., Shirleysherman Karen S., Effects of yoga on mental and physical health: A short summary of reviews, Evidence-Based Complementary and Alternative Medicine, 2012, (2012)
[9]  
Chen H.-T., He Y.-Z., Hsu C.-C., Computer-assisted yoga training system, Multimedia Tools and Applications, 77, 18, pp. 23969-23991, (2018)
[10]  
Hua-Tsunghe C., Chun-Chieh Y.-Z., Chien-Li C., Suh-Yinlin Bao-Shuh L., Yoga posture recognition for self-training, International Conference on Multimedia Modeling, pp. 496-505, (2014)