Uniting holistic and part-based attitudes for accurate and robust deep human pose estimation

被引:7
作者
Shamsafar, Faranak [1 ,2 ]
Ebrahimnezhad, Hossein [1 ]
机构
[1] Sahand Univ Technol, Fac Elect Engn, Comp Vis Res Lab, Tabriz, Iran
[2] Univ Tubingen, WSI Inst Comp Sci, Tubingen, Germany
关键词
Human pose estimation; Holistic prediction; Part-based prediction; Deep learning; Convolutional neural network;
D O I
10.1007/s12652-020-02347-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has been utilized in many intelligent systems, including computer vision techniques. Human pose estimation is one of the popular tasks in computer vision that has benefited from modern feature learning strategies. In this regard, recent advances propose part-based approaches since pose estimation based on parts can produce more accurate results than when the human shape is considered holistically as one unbreakable, but deformable object. However, in real-word scenarios, problems like occlusion and cluttered background make difficulties in part-based methods. In this paper, we propose to unite the two attitudes of the part-based and the holistic pose predictions to make more accurate and more robust estimations. These two schemes are modeled using convolutional neural networks as regression and classification tasks in order, and are combined in three frameworks: multitasking, series, and parallel. Each of these settings has its own advantages, and the experimental results on the LSP test set demonstrate that it is essential to observe subjects, both based on parts and holistically in order to achieve more accurate and more robust estimation of human pose in challenging scenarios.
引用
收藏
页码:2339 / 2353
页数:15
相关论文
共 52 条
[1]   Recovering 3D human pose from monocular images [J].
Agarwal, A ;
Triggs, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) :44-58
[2]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[3]  
[Anonymous], 2008, 2008 IEEE C COMP VIS
[4]  
[Anonymous], 2016, P 2016 BRIT MACH VIS
[5]  
[Anonymous], 2010, 2010 IEEE Computer society conference on computer vision and pattern recognition, DOI DOI 10.1109/CVPR.2010.5539906
[6]  
[Anonymous], ABS151203385 CORR
[7]  
[Anonymous], 2022, Science
[8]   Recurrent Human Pose Estimation [J].
Belagiannis, Vasileios ;
Zisserman, Andrew .
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, :468-475
[9]   Robust Optimization for Deep Regression [J].
Belagiannis, Vasileios ;
Rupprecht, Christian ;
Carneiro, Gustavo ;
Navab, Nassir .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2830-2838
[10]   Human Pose Estimation with Iterative Error Feedback [J].
Carreira, Joao ;
Agrawal, Pulkit ;
Fragkiadaki, Katerina ;
Malik, Jitendra .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4733-4742