View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

被引:0
|
作者
Ting Liu
Jennifer J. Sun
Long Zhao
Jiaping Zhao
Liangzhe Yuan
Yuxiao Wang
Liang-Chieh Chen
Florian Schroff
Hartwig Adam
机构
[1] Google Research,
[2] California Institute of Technology,undefined
[3] Rutgers University,undefined
来源
关键词
Human pose embedding; Probabilistic embedding; View-invariant pose retrieval; Action retrieval; Occlusion Robustness;
D O I
暂无
中图分类号
学科分类号
摘要
Recognition of human poses and actions is crucial for autonomous systems to interact smoothly with people. However, cameras generally capture human poses in 2D as images and videos, which can have significant appearance variations across viewpoints that make the recognition tasks challenging. To address this, we explore recognizing similarity in 3D human body poses from 2D information, which has not been well-studied in existing works. Here, we propose an approach to learning a compact view-invariant embedding space from 2D body joint keypoints, without explicitly predicting 3D poses. Input ambiguities of 2D poses from projection and occlusion are difficult to represent through a deterministic mapping, and therefore we adopt a probabilistic formulation for our embedding space. Experimental results show that our embedding model achieves higher accuracy when retrieving similar poses across different camera views, in comparison with 3D pose estimation models. We also show that by training a simple temporal embedding model, we achieve superior performance on pose sequence retrieval and largely reduce the embedding dimension from stacking frame-based embeddings for efficient large-scale retrieval. Furthermore, in order to enable our embeddings to work with partially visible input, we further investigate different keypoint occlusion augmentation strategies during training. We demonstrate that these occlusion augmentations significantly improve retrieval performance on partial 2D input poses. Results on action recognition and video alignment demonstrate that using our embeddings without any additional training achieves competitive performance relative to other models specifically trained for each task.
引用
收藏
页码:111 / 135
页数:24
相关论文
共 50 条
  • [1] View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose
    Liu, Ting
    Sun, Jennifer J.
    Zhao, Long
    Zhao, Jiaping
    Yuan, Liangzhe
    Wang, Yuxiao
    Chen, Liang-Chieh
    Schroff, Florian
    Adam, Hartwig
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (01) : 111 - 135
  • [2] Occlusion-Robust Model Learning for Human Pose Estimation
    Kawana, Yuki
    Ukita, Norimichi
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 494 - 498
  • [3] Pose-invariant and occlusion-robust neonatal facial pain assessment
    Zhao, Yisheng
    Zhu, Huaiyu
    Chen, Xiaofei
    Luo, Feixiang
    Li, Mengting
    Zhou, Jinyan
    Chen, Shuohui
    Pan, Yun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
  • [4] Automated Hand-Raising Detection in Classroom Videos: A View-Invariant and Occlusion-Robust Machine Learning Approach
    Buehler, Babette
    Hou, Ruikun
    Bozkir, Efe
    Goldberg, Patricia
    Gerjets, Peter
    Trautwein, Ulrich
    Kasneci, Enkelejda
    ARTIFICIAL INTELLIGENCE IN EDUCATION, AIED 2023, 2023, 13916 : 102 - 113
  • [5] Occlusion-Robust Pallet Pose Estimation for Warehouse Automation
    Vu, Van-Duc
    Hoang, Dinh-Dai
    Tan, Phan Xuan
    Nguyen, Van-Thiep
    Nguyen, Thu-Uyen
    Hoang, Ngoc-Anh
    Phan, Khanh-Toan
    Tran, Duc-Thanh
    Vu, Duy-Quang
    Ngo, Phuc-Quan
    Duong, Quang-Tri
    Nguyen, Anh-Nhat
    Hoang, Dinh-Cuong
    IEEE ACCESS, 2024, 12 : 1927 - 1942
  • [6] Occlusion-robust markerless surgical instrument pose estimation
    Xu, Haozheng
    Giannarou, Stamatia
    HEALTHCARE TECHNOLOGY LETTERS, 2024, : 327 - 335
  • [7] Occlusion-Robust Object Pose Estimation with Holistic Representation
    Chen, Bo
    Chin, Tat-Jun
    Klimavicius, Marius
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2223 - 2233
  • [8] View-Invariant Pose Analysis for Human Movement Assessment from RGB Data
    Sardari, Faegheh
    Paiement, Adeline
    Mirmehdi, Majid
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT II, 2019, 11752 : 237 - 248
  • [9] View Transfer on Human Skeleton Pose: Automatically Disentangle the View-Variant and View-Invariant Information for Pose Representation Learning
    Nie, Qiang
    Liu, Yunhui
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (01) : 1 - 22
  • [10] Accurate and occlusion-robust multi-view stereo
    Zhu, Zhaokun
    Stamatopoulos, Christos
    Fraser, Clive S.
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2015, 109 : 47 - 61