Enhancing egocentric 3D pose estimation with third person views

被引:4
|
作者
Dhamanaskar, Ameya [1 ]
Dimiccoli, Mariella [1 ]
Corona, Enric [1 ]
Pumarola, Albert [1 ]
Moreno-Noguer, Francesc [1 ]
机构
[1] UPC, CSIC, Inst Robot & Informat Ind, Carrer Llorens & Artigas 4-6, Barcelona 08028, Spain
关键词
3D pose estimation; Self -supervised learning; Egocentric vision;
D O I
10.1016/j.patcog.2023.109358
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel approach to enhance the 3D body pose estimation of a person computed from videos captured from a single wearable camera. The main technical contribution consists of leveraging high-level features linking first-and third-views in a joint embedding space. To learn such embedding space we introduce First2Third-Pose, a new paired synchronized dataset of nearly 20 0 0 videos depicting human activities captured from both first-and third-view perspectives. We explicitly consider spatial -and motion-domain features, combined using a semi-Siamese architecture trained in a self-supervised fashion. Experimental results demonstrate that the joint multi-view embedded space learned with our dataset is useful to extract discriminatory features from arbitrary single-view egocentric videos, with no need to perform any sort of domain adaptation or knowledge of camera parameters. An extensive evalu-ation demonstrates that we achieve significant improvement in egocentric 3D body pose estimation per-formance on two unconstrained datasets, over three supervised state-of-the-art approaches. The collected dataset and pre-trained model are available for research purposes.1 (c) 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Estimating Egocentric 3D Human Pose in Global Space
    Wang, Jian
    Liu, Lingjie
    Xu, Weipeng
    Sarkar, Kripasindhu
    Theobalt, Christian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11480 - 11489
  • [22] Light3DPose: Real-time Multi-Person 3D Pose Estimation from Multiple Views
    Elmi, Alessi
    Mazzini, Davide
    Tortella, Pietro
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2755 - 2762
  • [23] Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
    Fabbri, Matteo
    Lanzi, Fabio
    Calderara, Simone
    Alletto, Stefano
    Cucchiara, Rita
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7202 - 7211
  • [24] Hand PointNet-based 3D Hand Pose Estimation in Egocentric RGB-D Images
    Le, Van-Hung
    Hoang, Van-Nam
    Vu, Hai
    Le, Thi-Lan
    Tran, Thanh-Hai
    Vu, Viet-Vu
    PROCEEDINGS OF 202013TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2020), 2020, : 215 - 220
  • [25] A Real-time Multi-Person 3D Pose Estimation System from Multiple RGB-D Views for Live Streaming of 3D Animation
    Hwang, Taemin
    Kim, Jieun
    Kim, Myoungjin
    Kim, Minjoon
    COMPANION PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023 COMPANION, 2023, : 105 - 107
  • [26] 3D Hand Pose Detection in Egocentric RGB-D Images
    Rogez, Gregory
    Khademi, Maryam
    Supancic, J. S., III
    Montiel, J. M. M.
    Ramanan, Deva
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 356 - 371
  • [27] Automatic Calibration of the Fisheye Camera for Egocentric 3D Human Pose Estimation from a Single Image
    Zhang, Yahui
    You, Shaodi
    Gevers, Theo
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1771 - 1780
  • [28] Person-in-WiFi 3D: End-to-End Multi-Person 3D Pose Estimation with Wi-Fi
    Yan, Kangwei
    Wang, Fei
    Qian, Bo
    Ding, Han
    Han, Jinsong
    Wei, Xing
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 969 - 978
  • [29] Stabilization of 3D pose estimation
    Neddermeyer, W
    Schnell, M
    Winkler, W
    Lilienthal, A
    APPLICATIONS OF GEOMETRIC ALGEBRA IN COMPUTER SCIENCE AND ENGINEERING, 2002, : 385 - 394
  • [30] Multi-person 3D Pose Estimation from Monocular Image Sequences
    Li, Ran
    Xu, Nayun
    Lu, Xutong
    Xing, Yucheng
    Zhao, Haohua
    Niu, Li
    Zhang, Liqing
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 15 - 24