Deep Kinematic Pose Regression

被引:147
|
作者
Zhou, Xingyi [1 ]
Sun, Xiao [2 ]
Zhang, Wei [1 ]
Liang, Shuang [3 ]
Wei, Yichen [2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
[3] Tongji Univ, Shanghai, Peoples R China
关键词
Kinematic model; Human pose estimation; Deep learning;
D O I
10.1007/978-3-319-49409-8_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning articulated object pose is inherently difficult because the pose is high dimensional but has many structural constraints. Most existing work do not model such constraints and does not guarantee the geometric validity of their pose estimation, therefore requiring a post-processing to recover the correct geometry if desired, which is cumbersome and sub-optimal. In this work, we propose to directly embed a kinematic object model into the deep neutral network learning for general articulated object pose estimation. The kinematic function is defined on the appropriately parameterized object motion variables. It is differentiable and can be used in the gradient descent based optimization in network training. The prior knowledge on the object geometric model is fully exploited and the structure is guaranteed to be valid. We show convincing experiment results on a toy example and the 3D human pose estimation problem. For the latter we achieve state-of-the-art result on Human3.6M dataset.
引用
收藏
页码:186 / 201
页数:16
相关论文
共 50 条
  • [1] Satellite Pose Estimation with Deep Landmark Regression and Nonlinear Pose Refinement
    Chen, Bo
    Cao, Jiewei
    Parra, Alvaro
    Chin, Tat-Jun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2816 - 2824
  • [2] DEEP CAMERA POSE REGRESSION USING MOTION VECTORS
    Guo, Fei
    He, Yifeng
    Guan, Ling
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 4073 - 4077
  • [3] MIXTURE OF DEEP REGRESSION NETWORKS FOR HEAD POSE ESTIMATION
    Huang, Yangguang
    Pan, Lili
    Zheng, Yali
    Xie, Mei
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 4093 - 4097
  • [4] Geometric loss functions for camera pose regression with deep learning
    Kendall, Alex
    Cipolla, Roberto
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6555 - 6564
  • [5] A Kinematic Bottleneck Approach for Pose Regression of Flexible Surgical Instruments Directly From Images
    Sestini, Luca
    Rosa, Benoit
    De Momi, Elena
    Ferrigno, Giancarlo
    Padoy, Nicolas
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 2938 - 2945
  • [6] Pose Invariant 3D Facial Landmark Detection Via Pose Normalization and Deep Regression
    Zhang, Jingchen
    Gao, Kangkang
    Zhao, Qijun
    Wang, Daning
    PROCEEDINGS OF 2020 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MACHINE VISION AND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND MACHINE LEARNING, IPMV 2020, 2020, : 74 - 78
  • [7] Deep learning-based plane pose regression in obstetric ultrasound
    Di Vece, Chiara
    Dromey, Brian
    Vasconcelos, Francisco
    David, Anna L.
    Peebles, Donald
    Stoyanov, Danail
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (05) : 833 - 839
  • [8] DEEP REGRESSION FOREST WITH SOFT-ATTENTION FOR HEAD POSE ESTIMATION
    Ma, Xiangtian
    Sang, Nan
    Wang, Xupeng
    Xiao, Shihua
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2840 - 2844
  • [9] Deep learning-based plane pose regression in obstetric ultrasound
    Chiara Di Vece
    Brian Dromey
    Francisco Vasconcelos
    Anna L. David
    Donald Peebles
    Danail Stoyanov
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 833 - 839
  • [10] Cascaded Pose Regression
    Dollar, Piotr
    Welinder, Peter
    Perona, Pietro
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1078 - 1085