Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation

被引:3
作者
Niu, Zehai [1 ]
Lu, Ke [1 ,2 ]
Xue, Jian [1 ]
Wang, Jinbao [3 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Engn Sci, 19A Yuquan Rd, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Vanke Cloud City Phase I Bldg 8,Xili St, Shenzhen 518055, Guangdong, Peoples R China
[3] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
[4] Guangdong Prov Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
关键词
3D human pose estimation; Motion capture; Deep learning;
D O I
10.1016/j.cviu.2024.104059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multi -view 3D human pose estimation task relies on 2D human pose estimation for each view; however, severe occlusion, truncation, and human interaction lead to incorrect 2D human pose estimation for some views. The traditional "Matching-Lifting-Tracking"paradigm amplifies the incorrect 2D human pose into an incorrect 3D human pose, which significantly challenges the robustness of multi -view 3D human pose estimation. In this paper, we propose a novel method that tackles the inherent difficulties of the traditional paradigm. This method is rooted in the newly devised "Skeleton Pooling -Clustering -Tracking (SPCT)"paradigm. It initiates a 2D human pose estimation for each perspective. Then a symmetrical dilated network is created for skeleton pool estimation. Upon clustering the skeleton pool, we introduce and implement an innovative tracking method that is explicitly designed for the SPCT paradigm. The tracking method refines and filters the skeleton clusters, thereby enhancing the robustness of the multi -person 3D human pose estimation results. By coupling the skeleton pool with the tracking refinement process, our method obtains high -quality multi -person 3D human pose estimation results despite severe occlusions that produce erroneous 2D and 3D estimates. By employing the proposed SPCT paradigm and a computationally efficient network architecture, our method outperformed existing approaches regarding robustness on the Shelf, 4D Association, and CMU Panoptic datasets, and could be applied in practical scenarios such as markerless motion capture and animation production.
引用
收藏
页数:13
相关论文
共 48 条
  • [1] 3D Pictorial Structures for Multiple Human Pose Estimation
    Belagiannis, Vasileios
    Amin, Sikandar
    Andriluka, Mykhaylo
    Schiele, Bernt
    Navab, Nassir
    Ilic, Slobodan
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1669 - 1676
  • [2] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [3] Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS
    Chen, Long
    Ai, Haizhou
    Chen, Rui
    Zhuang, Zijie
    Liu, Shuang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3276 - 3285
  • [4] TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting
    Choudhury, Rohan
    Kitani, Kris M.
    Jeni, Laszlo A.
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14704 - 14714
  • [5] Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking
    Chu, Hau
    Lee, Jia-Hong
    Lee, Yao-Chih
    Hsu, Ching-Hsien
    Li, Jia-Da
    Chen, Chu-Song
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1472 - 1481
  • [6] A space-sweep approach to true multi-image matching
    Collins, RT
    [J]. 1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, : 358 - 363
  • [7] Congzhentao Huang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12373), P477, DOI 10.1007/978-3-030-58604-1_29
  • [8] Fast and Robust Multi-Person 3D Pose Estimation and Tracking From Multiple Views
    Dong, Junting
    Fang, Qi
    Jiang, Wen
    Yang, Yurou
    Huang, Qixing
    Bao, Hujun
    Zhou, Xiaowei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6981 - 6992
  • [9] Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views
    Dong, Junting
    Jiang, Wen
    Huang, Qixing
    Bao, Hujun
    Zhou, Xiaowei
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7784 - 7793
  • [10] Shape-aware Multi-Person Pose Estimation from Multi-View Images
    Dong, Zijian
    Song, Jie
    Chen, Xu
    Guo, Chen
    Hilliges, Otmar
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11138 - 11148