GRAPH ATTENTION CONVOLUTIONAL NETWORK FOR 3D HUMAN POSE AND SHAPE ESTIMATION FROM POINT CLOUDS

被引：1

作者：

Fan, Yung-Wei ^{[1
]}

Huang, Sheng-Chun ^{[1
]}

Chien, Shao-Yi ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年

关键词：

3D human pose and shape estimation; 3D human pose estimation; point clouds; GCNNs; Transformer;

D O I：

10.1109/ICME57554.2024.10688353

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose Graph Attention Convolutional Network for 3D human pose and shape estimation. Unlike most deep-learning methods that utilize RGB images as input, we opt for 3D data, believing it can convey richer information. Our method comprises two-stage models. The first, named the Local Joint Network (LJN), employs grouping techniques to gather points and predict 3D joints. The second is Graph Attention Convolutional Network, which takes 3D joints as input, leveraging a combination of Graph Convolutional Neural Networks (GCNNs) and Transformers. The key advantage lies in its consideration of both local and non-local interactions. We acquire point clouds using synthetic data and a Kinect v2 camera. Additionally, for RGB images lacking 3D information, we introduce a Point Cloud Generation System capable of synthesizing 3D data. To the best of our knowledge, we are the first to apply this mechanism to this field.

引用

页数：6

共 27 条

[1] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image [J].

Bogo, Federica ;

Kanazawa, Angjoo ;

Lassner, Christoph ;

Gehler, Peter ;

Romero, Javier ;

Black, Michael J. .

COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :561-578

[2]

Bradski G, 2000, DR DOBBS J, V25, P120

[3] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[4]

Cheng Yu, 2020, ARXIV

[5] Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose [J].

Choi, Hongsuk ;

Moon, Gyeongsik ;

Lee, Kyoung Mu .

COMPUTER VISION - ECCV 2020, PT VII, 2020, 12352 :769-787

[6] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[7]

Dhillon IS, 2007, IEEE T PATTERN ANAL, V29, P1944, DOI 10.1109/TP'AMI.2007.1115

[8] DensePose: Dense Human Pose Estimation In The Wild [J].

Guler, Riza Alp ;

Neverova, Natalia ;

Kokkinos, Lasonas .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7297-7306

[9] Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments [J].

Ionescu, Catalin ;

Papava, Dragos ;

Olaru, Vlad ;

Sminchisescu, Cristian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) :1325-1339

[10] Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos [J].

Jafarian, Yasamin ;

Park, Hyun Soo .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12748-12757

← 1 2 3 →