Recovering 3D human pose from monocular images

被引:434
|
作者
Agarwal, A [1 ]
Triggs, B [1 ]
机构
[1] INRIA Rhone Alpes, F-38330 Montbonnot St Martin, France
关键词
computer vision; human motion estimation; machine learning; multivariate regression;
D O I
10.1109/TPAMI.2006.21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a learning-based method for recovering 3D human body pose from single images and monocular image sequences. Our approach requires neither an explicit body model nor prior labeling of body parts in the image. Instead, it recovers pose by direct nonlinear regression against shape descriptor vectors extracted automatically from image silhouettes. For robustness against local silhouette segmentation errors, silhouette shape is encoded by histogram-of-shape-contexts descriptors. We evaluate several different regression methods: ridge regression, Relevance Vector Machine (RVM) regression, and Support Vector Machine (SVM) regression over both linear and kernel bases. The RVMs provide much sparser regressors without compromising performance, and kernel bases give a small but worthwhile improvement in performance. The loss of depth and limb labeling information often makes the recovery of 3D pose from single silhouettes ambiguous. To handle this, the method is embedded in a novel regressive tracking framework, using dynamics from the previous state estimate together with a learned regression value to disambiguate the pose. We show that the resulting system tracks long sequences stably. For realism and good generalization over a wide range of viewpoints, we train the regressors on images resynthesized from real human motion capture data. The method is demonstrated for several representations of full body pose, both quantitatively on independent but similar test data and qualitatively on real image sequences. Mean angular errors of 4-6 degrees are obtained for a variety of walking motions.
引用
收藏
页码:44 / 58
页数:15
相关论文
共 50 条
  • [1] Recovering 3D Human Mesh From Monocular Images: A Survey
    Tian, Yating
    Zhang, Hongwen
    Liu, Yebin
    Wang, Limin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15406 - 15425
  • [2] Multi-Person 3D Human Pose Estimation from Monocular Images
    Dabral, Rishabh
    Gundavarapu, Nitesh B.
    Mitra, Rahul
    Sharma, Abhishek
    Ramakrishnan, Ganesh
    Jain, Arjun
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 405 - 414
  • [3] End-to-End Algorithm for Recovering Human 3D Model from Monocular Images
    Liu, Yu
    Shi, Taichu
    Xu, Lexi
    Nie, Jingwen
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1082 - 1087
  • [4] Learning Monocular 3D Human Pose Estimation from Multi-view Images
    Rhodin, Helge
    Sporri, Jorg
    Katircioglu, Isinsu
    Constantin, Victor
    Meyer, Frederic
    Mueller, Erich
    Salzmann, Mathieu
    Fua, Pascal
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8437 - 8446
  • [5] 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network
    Li, Sijin
    Chan, Antoni B.
    COMPUTER VISION - ACCV 2014, PT II, 2015, 9004 : 332 - 347
  • [6] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Liu, Shuangjun
    Sehgal, Naveen
    Ostadabbas, Sarah
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14491 - 14506
  • [7] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Shuangjun Liu
    Naveen Sehgal
    Sarah Ostadabbas
    Applied Intelligence, 2022, 52 : 14491 - 14506
  • [8] Context-based Appearance Descriptor for 3D Human Pose estimation from Monocular Images
    Sedai, S.
    Bennamoun, M.
    Huynh, D.
    2009 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2009), 2009, : 484 - 491
  • [9] A survey on monocular 3D human pose estimation
    Ji X.
    Fang Q.
    Dong J.
    Shuai Q.
    Jiang W.
    Zhou X.
    Virtual Reality and Intelligent Hardware, 2020, 2 (06): : 471 - 500
  • [10] MONOCULAR 3D HUMAN POSE ESTIMATION BY CLASSIFICATION
    Greif, Thomas
    Lienhart, Rainer
    Sengupta, Debabrata
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,