Investigating Depth Domain Adaptation for Efficient Human Pose Estimation

被引:0
|
作者
Martinez-Gonzalez, Angel [1 ,2 ]
Villamizar, Michael [1 ]
Canevet, Olivier [1 ]
Odobez, Jean-Marc [1 ,2 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne EPFL, Lausanne, Switzerland
来源
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II | 2019年 / 11130卷
基金
欧盟地平线“2020”;
关键词
Human pose estimation; Adversarial learning; Domain adaptation; Machine learning;
D O I
10.1007/978-3-030-11012-3_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNN) are the leading models for human body landmark detection from RGB vision data. However, as such models require high computational load, an alternative is to rely on depth images which, due to their more simple nature, can allow the use of less complex CNNs and hence can lead to a faster detector. As learning CNNs from scratch requires large amounts of labeled data, which are not always available or expensive to obtain, we propose to rely on simulations and synthetic examples to build a large training dataset with precise labels. Nevertheless, the final performance on real data will suffer from the mismatch between the training and test data, also called domain shift between the source and target distributions. Thus in this paper, our main contribution is to investigate the use of unsupervised domain adaptation techniques to fill the gap in performance introduced by these distribution differences. The challenge lies in the important noise differences (not only gaussian noise, but many missing values around body limbs) between synthetic and real data, as well as the fact that we address a regression task rather than a classification one. In addition, we introduce a new public dataset of synthetically generated depth images to cover the cases of multi-person pose estimation. Our experiments show that domain adaptation provides some improvement, but that further network fine-tuning with real annotated data is worth including to supervise the adaptation process.
引用
收藏
页码:346 / 363
页数:18
相关论文
共 50 条
  • [31] A Survey on Depth Ambiguity of 3D Human Pose Estimation
    Zhang, Siqi
    Wang, Chaofang
    Dong, Wenlong
    Fan, Bin
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [32] Learning Domain Invariant Features for Unsupervised Indoor Depth Estimation Adaptation
    Zhang, Jiehua
    Li, Liang
    Yan, Chenggang
    Wang, Zhan
    Xu, Changliang
    Zhang, Jiyong
    Chen, Chuqiao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (09)
  • [33] EFFICIENT MODELING BY SELECTING LEARNING SAMPLES IN HUMAN POSE ESTIMATION
    Ukita, Norimichi
    Matsuyama, Yoichi
    Hagita, Norihiro
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [34] Automatic and Efficient Human Pose Estimation for Sign Language Videos
    James Charles
    Tomas Pfister
    Mark Everingham
    Andrew Zisserman
    International Journal of Computer Vision, 2014, 110 : 70 - 90
  • [35] A simple and efficient channel MLP on token for human pose estimation
    Jianglong Huang
    Chaoqun Hong
    Rongsheng Xie
    Lang Ran
    Jialong Qian
    International Journal of Machine Learning and Cybernetics, 2025, 16 (5) : 3809 - 3817
  • [36] LIGHTPOSE: A LIGHTWEIGHT AND EFFICIENT MODEL WITH TRANSFORMER FOR HUMAN POSE ESTIMATION
    Liu, Xiyang
    Li, Peng
    Ni, Ding
    Wang, Yan
    Xue, Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2674 - 2678
  • [37] Efficient Spatial-Attention Module for Human Pose Estimation
    Tran, Tien-Dat
    Vo, Xuan-Thuy
    Nguyen, Duy-Linh
    Jo, Kang-Hyun
    FRONTIERS OF COMPUTER VISION, IW-FCV 2021, 2021, 1405 : 242 - 250
  • [38] Pixel-Level Domain Adaptation for Real-to-Sim Object Pose Estimation
    Qian, Kun
    Duan, Yanhui
    Luo, Chaomin
    Zhao, Yongqiang
    Jing, Xingshuo
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1618 - 1627
  • [39] Automatic and Efficient Human Pose Estimation for Sign Language Videos
    Charles, James
    Pfister, Tomas
    Everingham, Mark
    Zisserman, Andrew
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 110 (01) : 70 - 90
  • [40] PoseAnalyser: A Survey on Human Pose Estimation
    Kulkarni S.
    Deshmukh S.
    Fernandes F.
    Patil A.
    Jabade V.
    SN Computer Science, 4 (2)