Unsupervised 3D Pose Estimation for Hierarchical Dance Video Recognition

被引:5
|
作者
Hu, Xiaodan [1 ]
Ahuja, Narendra [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Champaign, IL 61820 USA
基金
美国食品与农业研究所;
关键词
D O I
10.1109/ICCV48922.2021.01083
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dance experts often view dance as a hierarchy of information, spanning low-level (raw images, image sequences), mid-levels (human poses and bodypart movements), and high-level (dance genre). We propose a Hierarchical Dance Video Recognition framework (HDVR). HDVR estimates 2D pose sequences, tracks dancers, and then simultaneously estimates corresponding 3D poses and 3D-to-2D imaging parameters, without requiring ground truth for 3D poses. Unlike most methods that work on a single person, our tracking works on multiple dancers, under occlusions. From the estimated 3D pose sequence, HDVR extracts body part movements, and therefrom dance genre. The resulting hierarchical dance representation is explainable to experts. To overcome noise and interframe correspondence ambiguities, we enforce spatial and temporal motion smoothness and photometric continuity over time. We use an LSTM network to extract 3D movement subsequences from which we recognize dance genre. For experiments, we have identified 154 movement types, of 16 body parts, and assembled a new University of Illinois Dance (UID) Dataset, containing 1143 video clips of 9 genres covering 30 hours, annotated with movement and genre labels. Our experimental results demonstrate that our algorithms outperform the state-of-the-art 3D pose estimation methods, which also enhances our dance recognition performance.
引用
收藏
页码:10995 / 11004
页数:10
相关论文
共 50 条
  • [21] Field testing of a 3D target recognition and pose estimation algorithm
    Ruel, S
    English, C
    Melo, L
    Berube, A
    Aikman, D
    Deslauriers, A
    Church, P
    Maheux, J
    AUTOMATIC TARGET RECOGNITION XIV, 2004, 5426 : 102 - 111
  • [22] 3D Log Recognition and Pose Estimation for Robotic Forestry Machine
    Park, Yeonchool
    Shiriaev, Anton
    Westerberg, Simon
    Lee, Sukhan
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
  • [23] 3D object recognition and pose estimation using kernel PCA
    Zhao, LW
    Luo, SW
    Liao, LZ
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 3258 - 3262
  • [24] Multi-Person Hierarchical 3D Pose Estimation in Natural Videos
    Gu, Renshu
    Wang, Gaoang
    Jiang, Zhongyu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4245 - 4257
  • [25] Towards Alleviating the Modeling Ambiguity of Unsupervised Monocular 3D Human Pose Estimation
    Yu, Zhenbo
    Ni, Bingbing
    Xu, Jingwei
    Wang, Junjie
    Zhao, Chenglong
    Zhang, Wenjun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8631 - 8640
  • [26] Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation
    Yang, Yuchen
    Qiao, Yu
    Sun, Xiao
    COMPUTER VISION-ECCV 2024, PT XLIV, 2025, 15102 : 38 - 55
  • [27] Kinematic-Structure-Preserved Representation for Unsupervised 3D Human Pose Estimation
    Kundu, Jogendra Nath
    Seth, Siddharth
    Rahul, M., V
    Rakesh, Mugalodi
    Babu, R. Venkatesh
    Chakraborty, Anirban
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11312 - 11319
  • [28] OCR-Pose: Occlusion-aware Contrastive Representation for Unsupervised 3D Human Pose Estimation
    Wang, Junjie
    Yu, Zhenbo
    Tong, Zhengyan
    Wang, Hang
    Liu, Jinxian
    Zhang, Wenjun
    Wu, Xiaoyan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5477 - 5485
  • [29] LOCAL TO GLOBAL TRANSFORMER FOR VIDEO BASED 3D HUMAN POSE ESTIMATION
    Ma, Haifeng
    Ke Lu
    Xue, Jian
    Niu, Zehai
    Gao, Pengcheng
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [30] Occlusion-Aware Networks for 3D Human Pose Estimation in Video
    Cheng, Yu
    Yang, Bo
    Wang, Bo
    Yan, Wending
    Tan, Robby T.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 723 - 732