Investigating Depth Domain Adaptation for Efficient Human Pose Estimation

被引：0

作者：

Martinez-Gonzalez, Angel ^{[1
,2
]}

Villamizar, Michael ^{[1
]}

Canevet, Olivier ^{[1
]}

Odobez, Jean-Marc ^{[1
,2
]}

机构：

[1] Idiap Res Inst, Martigny, Switzerland

[2] Ecole Polytech Fed Lausanne EPFL, Lausanne, Switzerland

来源：

COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II | 2019年 / 11130卷

基金：

欧盟地平线“2020”;

关键词：

Human pose estimation; Adversarial learning; Domain adaptation; Machine learning;

D O I：

10.1007/978-3-030-11012-3_28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional Neural Networks (CNN) are the leading models for human body landmark detection from RGB vision data. However, as such models require high computational load, an alternative is to rely on depth images which, due to their more simple nature, can allow the use of less complex CNNs and hence can lead to a faster detector. As learning CNNs from scratch requires large amounts of labeled data, which are not always available or expensive to obtain, we propose to rely on simulations and synthetic examples to build a large training dataset with precise labels. Nevertheless, the final performance on real data will suffer from the mismatch between the training and test data, also called domain shift between the source and target distributions. Thus in this paper, our main contribution is to investigate the use of unsupervised domain adaptation techniques to fill the gap in performance introduced by these distribution differences. The challenge lies in the important noise differences (not only gaussian noise, but many missing values around body limbs) between synthetic and real data, as well as the fact that we address a regression task rather than a classification one. In addition, we introduce a new public dataset of synthetically generated depth images to cover the cases of multi-person pose estimation. Our experiments show that domain adaptation provides some improvement, but that further network fine-tuning with real annotated data is worth including to supervise the adaptation process.

引用

页码：346 / 363

页数：18

共 50 条

[31] A Survey on Depth Ambiguity of 3D Human Pose Estimation
Zhang, Siqi
Wang, Chaofang
Dong, Wenlong
Fan, Bin
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[32] Learning Domain Invariant Features for Unsupervised Indoor Depth Estimation Adaptation
Zhang, Jiehua
Li, Liang
Yan, Chenggang
Wang, Zhan
Xu, Changliang
Zhang, Jiyong
Chen, Chuqiao
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (09)
[33] EFFICIENT MODELING BY SELECTING LEARNING SAMPLES IN HUMAN POSE ESTIMATION
Ukita, Norimichi
Matsuyama, Yoichi
Hagita, Norihiro
2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
[34] Automatic and Efficient Human Pose Estimation for Sign Language Videos
James Charles
Tomas Pfister
Mark Everingham
Andrew Zisserman
International Journal of Computer Vision, 2014, 110 : 70 - 90
[35] A simple and efficient channel MLP on token for human pose estimation
Jianglong Huang
Chaoqun Hong
Rongsheng Xie
Lang Ran
Jialong Qian
International Journal of Machine Learning and Cybernetics, 2025, 16 (5) : 3809 - 3817
[36] LIGHTPOSE: A LIGHTWEIGHT AND EFFICIENT MODEL WITH TRANSFORMER FOR HUMAN POSE ESTIMATION
Liu, Xiyang
Li, Peng
Ni, Ding
Wang, Yan
Xue, Hui
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2674 - 2678
[37] Efficient Spatial-Attention Module for Human Pose Estimation
Tran, Tien-Dat
Vo, Xuan-Thuy
Nguyen, Duy-Linh
Jo, Kang-Hyun
FRONTIERS OF COMPUTER VISION, IW-FCV 2021, 2021, 1405 : 242 - 250
[38] Pixel-Level Domain Adaptation for Real-to-Sim Object Pose Estimation
Qian, Kun
Duan, Yanhui
Luo, Chaomin
Zhao, Yongqiang
Jing, Xingshuo
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1618 - 1627
[39] Automatic and Efficient Human Pose Estimation for Sign Language Videos
Charles, James
Pfister, Tomas
Everingham, Mark
Zisserman, Andrew
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 110 (01) : 70 - 90
[40] PoseAnalyser: A Survey on Human Pose Estimation
Kulkarni S.
Deshmukh S.
Fernandes F.
Patil A.
Jabade V.
SN Computer Science, 4 (2)

← 1 2 3 4 5 →