EHPE: Skeleton Cues-Based Gaussian Coordinate Encoding for Efficient Human Pose Estimation

被引:103
作者
Liu, Hai [1 ]
Liu, Tingting [2 ,3 ]
Chen, Yu [1 ]
Zhang, Zhaoli [1 ]
Li, You-Fu [3 ,4 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China
[2] Hubei Univ, Sch Educ, Wuhan 430062, Hubei, Peoples R China
[3] City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China
[4] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Heating systems; Encoding; Biological system modeling; Task analysis; Pose estimation; Feature extraction; Skeleton; Deep learning; gaussian coordinate encoding; human pose estimation; regularization; skeleton direction;
D O I
10.1109/TMM.2022.3197364
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose estimation (HPE) has many wide applications such as multimedia processing, behavior understanding and human-computer interaction. Most previous studies have encountered many constraints, such as restricted scenarios and RGB inputs. To mitigate constraints to estimating the human poses in general scenarios, we present an efficient human pose estimation model (i.e., EHPE) with joint direction cues and Gaussian coordinate encoding. Specifically, we propose an anisotropic Gaussian coordinate coding method to describe the skeleton direction cues among adjacent keypoints. To the best of our knowledge, this is the first time that the skeleton direction cues is introduced to the heatmap encoding in HPE task. Then, a multi-loss function is proposed to constrain the output to prevent the overfitting. The Kullback-Leibler divergence is introduced to measure the predication label and its ground truth one. The performance of EHPE is evaluated on two HPE datasets: MS COCO and MPII. Experimental results demonstrate that EHPE can obtain robust results, and it significantly outperforms existing state-of-the-art HPE methods. Lastly, we extend the experiments on infrared images captured by our research group. The experiments achieved the impressive results regardless of insufficient color and texture information.
引用
收藏
页码:8464 / 8475
页数:12
相关论文
共 43 条
[1]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[2]   UniPose: Unified Human Pose Estimation in Single Images and Videos [J].
Artacho, Bruno ;
Savakis, Andreas .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :7033-7042
[3]   On the Analysis of Human Posture for Detecting Social Interactions with Wearable Devices [J].
Baronti, Paolo ;
Girolami, Michele ;
Mavilia, Fabio ;
Palumbo, Filipp ;
Luisetto, Giancarlo .
PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, :340-345
[4]   Face-from-Depth for Head Pose Estimation on Depth Images [J].
Borghi, Guido ;
Fabbri, Matteo ;
Vezzani, Roberto ;
Calderara, Simone ;
Cucchiara, Rita .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (03) :596-609
[5]   OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields [J].
Cao, Zhe ;
Hidalgo, Gines ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :172-186
[6]   Human Pose Estimation with Iterative Error Feedback [J].
Carreira, Joao ;
Agrawal, Pulkit ;
Fragkiadaki, Katerina ;
Malik, Jitendra .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4733-4742
[7]   Gaze Estimation via the Joint Modeling of Multiple Cues [J].
Chen, Wenhe ;
Xu, Hui ;
Zhu, Chao ;
Liu, Xiaoli ;
Lu, Yinghua ;
Zheng, Caixia ;
Kong, Jun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) :1390-1402
[8]   Cascaded Pyramid Network for Multi-Person Pose Estimation [J].
Chen, Yilun ;
Wang, Zhicheng ;
Peng, Yuxiang ;
Zhang, Zhiqiang ;
Yu, Gang ;
Sun, Jian .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7103-7112
[9]   Multi-Context Attention for Human Pose Estimation [J].
Chu, Xiao ;
Yang, Wei ;
Ouyang, Wanli ;
Ma, Cheng ;
Yuille, Alan L. ;
Wang, Xiaogang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5669-5678
[10]   Articulated and Generalized Gaussian Kernel Correlation for Human Pose Estimation [J].
Ding, Meng ;
Fan, Guoliang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) :776-789