PoseAnalyser: A Survey on Human Pose Estimation

被引:0
作者
Kulkarni S. [1 ]
Deshmukh S. [1 ]
Fernandes F. [1 ]
Patil A. [2 ]
Jabade V. [1 ]
机构
[1] Vishwakarma Institute of Technology, Pune
[2] Ifm Engineering Private Limited, Pune
关键词
Computer vision; Convolutional neural networks; Human pose estimation; Machine learning; MediaPipe; OpenPose;
D O I
10.1007/s42979-022-01567-2
中图分类号
学科分类号
摘要
Human pose estimation is the process of detecting the body keypoints of a person and can be used to classify different poses. Many researchers have proposed various ways to get a perfect 2D as well as a 3D human pose estimator that could be applied for various types of applications. This paper is a review of all the state-of-the-art architectures based on human pose estimation, the papers referred were based on the types of computer vision and machine learning algorithms, such as feed-forward neural networks, convolutional neural networks (CNN), OpenPose, MediaPipe, and many more. These different approaches are compared on various parameters, like the type of dataset used, the evaluation metric, etc. Different human pose datasets, such as COCO and MPII activity datasets with keypoints, as well as specific application-based datasets, are reviewed in this survey paper. Researchers may use these architectures and datasets in a range of domains, which are also discussed. The paper analyzes several approaches and architectures that can be used as a guide for other researchers to assist them in developing better techniques to achieve high accuracy. © 2022, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 49 条
[41]  
Rogez G., Weinzaepfel P., Schmid C., LCR-net++: multi-person 2D and 3D pose detection in natural images, IEEE Trans Pattern Anal Mach Intell, (2019)
[42]  
Holte M.B., Tran C., Trivedi M.M., Moeslund T.B., Human pose estimation and activity recognition from multi-view videos: comparative explorations of recent developments, IEEE J Sel Top Signal Process, 6, pp. 538-552, (2012)
[43]  
Chaikaew A., Somkuan K., Yuyen T., Thai sign language recognition: an application of deep neural network, 2021 joint international conference on digital arts, media and technology with ECTI northern section conference on electrical, electronics, computer and telecommunication engineering, (2021)
[44]  
Chen Y., Du R., Luo K., Xiao Y., Fall detection system based on real-time pose estimation and SVM, 2021 IEEE 2nd international conference on big data, artificial intelligence and internet of things engineering (ICBAIE), (2021)
[45]  
Belagiannis V., Zisserman A., Recurrent human pose estimation, 2017 12th IEEE international conference on automatic face & gesture recognition (FG 2017), (2017)
[46]  
Lin T.-Y., Maire M., Belongie S., Hays J., Perona P., Ramanan D., Dollar P., Zitnick C.L., Microsoft Coco: Common objects in context, Computer vision—ECCV 2014. Pp., pp. 740-755, (2014)
[47]  
Andriluka M., Pishchulin L., Gehler P., Schiele B., MPII human pose dataset, 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2014)
[48]  
Sapp B., Taskar B., MODEC: multimodal decomposable models for human pose estimation, 2013 IEEE conference on computer vision and pattern recognition, (2013)
[49]  
Johnson S., Everingham M., Clustered pose and nonlinear appearance models for human pose estimation, Proceedings of the British Machine Vision Conference, 2010, (2010)