Generating Diverse and Natural 3D Human Motions from Text

被引:150
|
作者
Guo, Chuan [1 ]
Zou, Shihao [1 ]
Zuo, Xinxin [1 ]
Wang, Sen [1 ]
Ji, Wei [1 ]
Li, Xingyu [1 ]
Cheng, Li [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/CVPR52688.2022.00509
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated generation of 3D human motions from text is a challenging problem. The generated motions are expected to be sufficiently diverse to explore the text-grounded motion space, and more importantly, accurately depicting the content in prescribed text descriptions. Here we tackle this problem with a two-stage approach: text2length sampling and text2motion generation. Text2length involves sampling from the learned distribution function of motion lengths conditioned on the input text. This is followed by our text2motion module using temporal variational autoencoder to synthesize a diverse set of human motions of the sampled lengths. Instead of directly engaging with pose sequences, we propose motion snippet code as our internal motion representation, which captures local semantic motion contexts and is empirically shown to facilitate the generation of plausible motions faithful to the input text. Moreover, a large-scale dataset of scripted 3D Human motions, HumanML3D, is constructed, consisting of 14,616 motion clips and 44,970 text descriptions.
引用
收藏
页码:5142 / 5151
页数:10
相关论文
共 50 条
  • [41] Recognizing 3D Human Motions Using Fuzzy Quantile Inference
    Khoury, Mehdi
    Liu, Honghai
    INTELLIGENT ROBOTICS AND APPLICATIONS, PT I, 2010, 6424 : 680 - 691
  • [42] Analyzing Clothing Layer Deformation Statistics of 3D Human Motions
    Yang, Jinlong
    Franco, Jean-Sebastien
    Hetroy-Wheeler, Franck
    Wuhrer, Stefanie
    COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 245 - 261
  • [43] Constructing 3D motions from curvature and torsion profiles
    Cripps, R. J.
    Mullineux, G.
    COMPUTER-AIDED DESIGN, 2012, 44 (05) : 379 - 387
  • [44] 3D modeling of natural convective heat transfer from a varying rectangular heat generating source
    A. Purusothaman
    K. Murugesan
    Ali J. Chamkha
    Journal of Thermal Analysis and Calorimetry, 2019, 138 : 597 - 608
  • [45] 3D modeling of natural convective heat transfer from a varying rectangular heat generating source
    Purusothaman, A.
    Murugesan, K.
    Chamkha, Ali J.
    JOURNAL OF THERMAL ANALYSIS AND CALORIMETRY, 2019, 138 (01) : 597 - 608
  • [46] Creation of 3D Scene from Raw Text
    Dessai, Sneha N.
    Dhanaraj, Rachel
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 1466 - 1469
  • [47] DreamHuman: Animatable 3D Avatars from Text
    Kolotouros, Nikos
    Alldieck, Thiemo
    Zanfir, Andrei
    Bazavan, Eduard Gabriel
    Fieraru, Mihai
    Sminchisescu, Cristian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [48] Generating coordinated natural language and 3D animations for complex spatial explanations
    Towns, SG
    Callaway, CB
    Lester, JC
    FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 112 - 119
  • [49] Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration
    Yang, Xiaofeng
    Liu, Fayao
    Xu, Yi
    Su, Hanjing
    Wu, Qingyao
    Lin, Guosheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6549 - 6557
  • [50] The Construction of 3D Conformal Motions
    Dorst, Leo
    MATHEMATICS IN COMPUTER SCIENCE, 2016, 10 (01) : 97 - 113