Generative Autoregressive Networks for 3D Dancing Move Synthesis From Music

被引:36
作者
Ahn, Hyemin [1 ,2 ]
Kim, Jaehun [3 ]
Kim, Kihyun [1 ,2 ]
Oh, Songhwai [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 08826, South Korea
[2] Seoul Natl Univ, ASRI, Seoul 08826, South Korea
[3] Delft Univ Technol, NL-2628 Delft, Netherlands
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2020年 / 5卷 / 02期
关键词
Three-dimensional displays; Generators; Task analysis; Multiple signal classification; Skeleton; Training; Music; Gesture; posture and facial expressions; novel deep learning methods; entertainment robotics;
D O I
10.1109/LRA.2020.2977333
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This letter proposes a framework which is able to generate a sequence of three-dimensional human dance poses for a given music. The proposed framework consists of three components: a music feature encoder, a pose generator, and a music genre classifier. We focus on integrating these components for generating a realistic 3D human dancing move from music, which can be applied to artificial agents and humanoid robots. The trained dance pose generator, which is a generative autoregressive model, is able to synthesize a dance sequence longer than 1,000 pose frames. Experimental results of generated dance sequences from various songs show how the proposed method generates human-like dancing move to a given music. In addition, a generated 3D dance sequence is applied to a humanoid robot, showing that the proposed framework can make a robot to dance just by listening to music.
引用
收藏
页码:3501 / 3508
页数:8
相关论文
共 29 条
  • [1] [Anonymous], POP STARS OPENING CE
  • [2] [Anonymous], 2016, PROC 9 ISCA SPEEC
  • [3] Bai S., 2018, ARXIV
  • [4] Bengio S, 2015, ADV NEUR IN, V28
  • [5] Berndt D, 1994, WORKSH KNOWL DISC DA, V398, P359
  • [6] Bertin-Mahieux Thierry, 2011, P 12 INT C MUS INF R
  • [7] RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS .1. THE METHOD OF PAIRED COMPARISONS
    BRADLEY, RA
    TERRY, ME
    [J]. BIOMETRIKA, 1952, 39 (3-4) : 324 - 345
  • [8] Crnkovic-Friis L., 2016, P 7 INT C COMP CREAT, P272, DOI DOI 10.48550/ARXIV.1605.06921
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]