Unsupervised 3D Pose Estimation for Hierarchical Dance Video Recognition

被引:5
|
作者
Hu, Xiaodan [1 ]
Ahuja, Narendra [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Champaign, IL 61820 USA
基金
美国食品与农业研究所;
关键词
D O I
10.1109/ICCV48922.2021.01083
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dance experts often view dance as a hierarchy of information, spanning low-level (raw images, image sequences), mid-levels (human poses and bodypart movements), and high-level (dance genre). We propose a Hierarchical Dance Video Recognition framework (HDVR). HDVR estimates 2D pose sequences, tracks dancers, and then simultaneously estimates corresponding 3D poses and 3D-to-2D imaging parameters, without requiring ground truth for 3D poses. Unlike most methods that work on a single person, our tracking works on multiple dancers, under occlusions. From the estimated 3D pose sequence, HDVR extracts body part movements, and therefrom dance genre. The resulting hierarchical dance representation is explainable to experts. To overcome noise and interframe correspondence ambiguities, we enforce spatial and temporal motion smoothness and photometric continuity over time. We use an LSTM network to extract 3D movement subsequences from which we recognize dance genre. For experiments, we have identified 154 movement types, of 16 body parts, and assembled a new University of Illinois Dance (UID) Dataset, containing 1143 video clips of 9 genres covering 30 hours, annotated with movement and genre labels. Our experimental results demonstrate that our algorithms outperform the state-of-the-art 3D pose estimation methods, which also enhances our dance recognition performance.
引用
收藏
页码:10995 / 11004
页数:10
相关论文
共 50 条
  • [31] Self-supervised 3D human pose estimation from video
    Gholami, Mohsen
    Rezaei, Ahmad
    Rhodin, Helge
    Ward, Rabab
    Wang, Z. Jane
    NEUROCOMPUTING, 2022, 488 : 97 - 106
  • [32] Stabilization of 3D pose estimation
    Neddermeyer, W
    Schnell, M
    Winkler, W
    Lilienthal, A
    APPLICATIONS OF GEOMETRIC ALGEBRA IN COMPUTER SCIENCE AND ENGINEERING, 2002, : 385 - 394
  • [33] 2D Action Recognition Serves 3D Human Pose Estimation
    Gall, Juergen
    Yao, Angela
    Van Gool, Luc
    COMPUTER VISION-ECCV 2010, PT III, 2010, 6313 : 425 - 438
  • [34] Development of a practical 3D automatic target recognition and pose estimation algorithm
    English, C
    Ruel, S
    Melo, L
    Church, P
    Maheux, J
    AUTOMATIC TARGET RECOGNITION XIV, 2004, 5426 : 112 - 123
  • [35] A novel method of target recognition and 3D pose estimation in unstructured environment
    Ren B.
    Wei K.
    Dai Y.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2019, 51 (01): : 38 - 44
  • [36] 3D head pose estimation using range images for face recognition
    Song, H
    Sohn, K
    2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 1256 - 1261
  • [37] BoxGraph: Semantic Place Recognition and Pose Estimation from 3D LiDAR
    Pramatarov, Georgi
    De Martini, Daniele
    Gadd, Matthew
    Newman, Paul
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 7004 - 7011
  • [38] HDPose: Post-Hierarchical Diffusion with Conditioning for 3D Human Pose Estimation
    Lee, Donghoon
    Kim, Jaeho
    SENSORS, 2024, 24 (03)
  • [39] A visual quality inspection system based on a hierarchical 3D pose estimation algorithm
    von Bank, C
    Gavrila, DM
    Wöhler, C
    PATTERN RECOGNITION, PROCEEDINGS, 2003, 2781 : 179 - 186
  • [40] Cascaded Hierarchical CNN for RGB-Based 3D Hand Pose Estimation
    Dai, Shiming
    Liu, Wei
    Yang, Wenji
    Fan, Lili
    Zhang, Jihao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020