Parallel-branch network for 3D human pose and shape estimation in video

被引：5

作者：

Wu, Yuanhao ^{[1
]}

Wang, Chenxing ^{[2
]}

机构：

[1] Southeast Univ, Suzhou Res Inst, Nanjing, Peoples R China

[2] Southeast Univ, Nanjing, Peoples R China

来源：

COMPUTER ANIMATION AND VIRTUAL WORLDS | 2022年 / 33卷 / 3-4期

关键词：

human pose estimation; parallel networks; transformer;

D O I：

10.1002/cav.2078

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Human pose and shape estimation have developed rapidly, where a skinned multi-person linear (SMPL) approach performs excellent recently. However, the prior template of the human body in the SMPL model is fixed, thus a deviation may be resulted in the reconstructed body shape if a human body acts sharp movements such as sporting or dancing. To address this problem, we propose a parallel-branch network including a designed spatial-temporal (ST) branch and a SMPL branch. The ST branch essentially performs the 2D-to-3D lifting for more accurate joint prediction, by the designed spatial transformer and temporal transformer. The 3D joints from the ST branch are used to supervise the 3D joints from the SMPL branch and further correct the deviation of the SMPL model. Experiments on some popular benchmarks like 3DPW and MPI-INF-3DHP show that our method has better performance than other methods with video input. Our code is available at

引用

页数：10

共 36 条

[1] PoseTrack: A Benchmark for Human Pose Estimation and Tracking [J].

Andriluka, Mykhaylo ;

Iqbal, Umar ;

Insafutdinov, Eldar ;

Pishchulin, Leonid ;

Milan, Anton ;

Gall, Juergen ;

Schiele, Bernt .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5167-5176

[2] 3D Human Pose Estimation via Deep Learning from 2D annotations [J].

Brau, Ernesto ;

Jiang, Hao .

PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :582-591

[3] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[4] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[5] HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation [J].

Cheng, Bowen ;

Xiao, Bin ;

Wang, Jingdong ;

Shi, Honghui ;

Huang, Thomas S. ;

Zhang, Lei .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5385-5394

[6] 3D Human Pose Estimation=2D Pose Estimation plus Matching [J].

Chen, Ching-Hang ;

Ramanan, Deva .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5759-5767

[7] Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video [J].

Choi, Hongsuk ;

Moon, Gyeongsik ;

Chang, Ju Yong ;

Lee, Kyoung Mu .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1964-1973

[8] Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose [J].

Choi, Hongsuk ;

Moon, Gyeongsik ;

Lee, Kyoung Mu .

COMPUTER VISION - ECCV 2020, PT VII, 2020, 12352 :769-787

[9]

Chung Junyoung, 2014, ARXIV

[10]

Dosovitskiy A, 2020, ARXIV

← 1 2 3 4 →