Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video

被引：0

作者：

Sun, Cheng ^{[1
]}

Thomas, Diego ^{[1
]}

Kawasaki, Hiroshi ^{[1
]}

机构：

[1] Kyushu Univ, Fukuoka, Japan

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

关键词：

D O I：

10.1109/ICPR48806.2021.9412270

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D human pose estimation from a single 2D video is an extremely difficult task because computing 3D geometry from 2D images is an ill-posed problem. Recent popular solutions adopt fully-supervised learning strategy, which requires to train a deep network on a large-scale ground truth dataset of 3D poses and 2D images. However, such a large-scale dataset with natural images does not exist, which limits the usability of existing methods. While building a complete 3D dataset is tedious and expensive, abundant 2D in-the-wild data is already publicly available. As a consequence, there is a growing interest in the computer vision community to design efficient techniques that use the unsupervised learning strategy, which does not require any ground truth 3D data. Such methods can be trained with only natural 2D images of humans. In this paper we propose an unsupervised method for estimating 3D human pose in videos. The standard approach for unsupervised learning is to use the Generative Adversarial Network (GAN) framework. To improve the performance of 3D human pose estimation in videos, we propose a new GAN network that enforces body consistency over frames in a video. We evaluate the efficiency of our proposed method on a public 3D human body dataset.

引用

页码：5959 / 5964

页数：6

共 50 条

[21] Deep Semantic Graph Transformer for Multi-View 3D Human Pose Estimation
Zhang, Lijun
Zhou, Kangkang
Lu, Feng
Zhou, Xiang-Dong
Shi, Yu
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7205 - 7214
[22] View Invariant 3D Human Pose Estimation
Wei, Guoqiang
Lan, Cuiling
Zeng, Wenjun
Chen, Zhibo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4601 - 4610
[23] 3D Human Pose Estimation from multi-view thermal vision sensors
Lupion, Marcos
Polo-Rodriguez, Aurora
Medina-Quero, Javier
Sanjuan, Juan F.
Ortigosa, Pilar M.
INFORMATION FUSION, 2024, 104
[24] Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation
Zhou, Kangkang
Zhang, Lijun
Lu, Feng
Zhou, Xiang-Dong
Shi, Yu
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7512 - 7520
[25] Unsupervised 3D Pose Estimation for Hierarchical Dance Video Recognition
Hu, Xiaodan
Ahuja, Narendra
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10995 - 11004
[26] MULTI HYBRID EXTRACTOR NETWORK FOR 3D HUMAN POSE ESTIMATION
Yuan, Zhixiang
Zhang, Xitie
Wu, Suping
Zhang, Boyang
Peng, Yuxin
Wang, Bing
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3170 - 3174
[27] Unsupervised Domain Adaptation for 3D Human Pose Estimation
Zhang, Xiheng
Wong, Yongkang
Kankanhalli, Mohan S.
Geng, Weidong
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 926 - 934
[28] Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation
Wu, Yongpeng
Kong, Dehui
Gao, Junna
Li, Jinghua
Yin, Baocai
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
[29] POSERN: A 2D POSE REFINEMENT NETWORK FOR BIAS-FREE MULTI-VIEW 3D HUMAN POSE ESTIMATION
Sayo, Akihiko
Thomas, Diego
Kawasaki, Hiroshi
Nakashima, Yuta
Ikeuchi, Katsushi
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3233 - 3237
[30] Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation
Niu, Zehai
Lu, Ke
Xue, Jian
Wang, Jinbao
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246

← 1 2 3 4 5 →