Expressive Forecasting of 3D Whole-Body Human Motions

被引:0
|
作者
Ding, Pengxiang [1 ,2 ]
Cui, Qiongjie [3 ,5 ]
Wang, Haofan
Zhang, Min [1 ,2 ]
Liu, Mengyuan [4 ]
Wang, Donglin [1 ]
机构
[1] Westlake Univ, MiLAB, Hangzhou, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
[3] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
[4] Peking Univ, Shenzhen Grad Sch, Beijing, Peoples R China
[5] Xiaohongshu Inc, Shanghai, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human motion forecasting, with the goal of estimating future human behavior over a period of time, is a fundamental task in many real-world applications. However, existing works typically concentrate on predicting the major joints of the human body without considering the delicate movements of the human hands. In practical applications, hand gesture plays an important role in human communication with the real world, and expresses the primary intention of human beings. In this work, we are the first to formulate a whole-body human pose forecasting task, which jointly predicts the future body and hand activities. Correspondingly, we propose a novel Encoding-Alignment-Interaction (EAI) framework that aims to predict both coarse (body joints) and finegrained (gestures) activities collaboratively, enabling expressive and cross-facilitated forecasting of 3D whole-body human motions. Specifically, our model involves two key constituents: cross-context alignment (XCA) and cross-context interaction (XCI). Considering the heterogeneous information within the whole-body, XCA aims to align the latent features of various human components, while XCI focuses on effectively capturing the context interaction among the human components. We conduct extensive experiments on a newly-introduced large-scale benchmark and achieve state-of-theart performance. The code is public for research purposes at https://github.com/Dingpx/EAI.
引用
收藏
页码:1537 / 1545
页数:9
相关论文
共 50 条
  • [1] Expressive Whole-Body 3D Gaussian Avatar
    Moon, Gyeongsik
    Shiratori, Takaaki
    Saito, Shunsuke
    COMPUTER VISION - ECCV 2024, PT XLI, 2025, 15099 : 19 - 35
  • [2] Forecasting of 3D Whole-body Human Poses with Grasping Objects
    Yan, Haitao
    Cui, Qiongjie
    Xie, Jiexin
    Guo, Shijie
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 1726 - 1736
  • [3] Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset
    Lin, Jing
    Zeng, Ailing
    Lu, Shunlin
    Cai, Yuanhao
    Zhang, Ruimao
    Wang, Haoqian
    Zhang, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation
    Moon, Gyeongsik
    Choi, Hongsuk
    Lee, Kyoung Mu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2307 - 2316
  • [5] 3D Whole-Body MRI of the Musculoskeletal System
    Pasoglou, Vassiliki
    Van Nieuwenhove, Sandy
    Peeters, Frank
    Duchene, Gaetan
    Kirchgesner, Thomas
    Lecouvet, Frederic E.
    SEMINARS IN MUSCULOSKELETAL RADIOLOGY, 2021, 25 (03) : 441 - 454
  • [6] Whole-Body Imitation of Human Motions with a Nao Humanoid
    Koenemann, Jonas
    Bennewitz, Maren
    HRI'12: PROCEEDINGS OF THE SEVENTH ANNUAL ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2012, : 425 - 425
  • [7] Whole-body 3D MR angiography in 72 seconds
    Herborn, CU
    Goyen, M
    Bosk, S
    Kroeger, K
    Debatin, JF
    Ruehm, SG
    RADIOLOGY, 2001, 221 : 264 - 264
  • [8] Whole-body 3D scanner and scan data report
    Addleman, S
    THREE-DIMENSIONAL IMAGE CAPTURE, 1997, 3023 : 2 - 5
  • [9] Towards Robust and Expressive Whole-body Human Pose and Shape Estimation
    Pang, Hui En
    Cai, Zhongang
    Yang, Lei
    Tao, Qingyi
    Wu, Zhonghua
    Zhang, Tianwei
    Liu, Ziwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] SEMANTIC ORGAN SEGMENTATION IN 3D WHOLE-BODY MR IMAGES
    Kuestner, Thomas
    Mueller, Sarah
    Fischer, Marc
    Weiss, Jakob
    Nikolaou, Konstantin
    Bamberg, Fabian
    Yang, Bin
    Schick, Fritz
    Gatidis, Sergios
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3498 - 3502