Generative Autoregressive Networks for 3D Dancing Move Synthesis From Music

被引：36

作者：

Ahn, Hyemin ^{[1
,2
]}

Kim, Jaehun ^{[3
]}

Kim, Kihyun ^{[1
,2
]}

Oh, Songhwai ^{[1
,2
]}

机构：

[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 08826, South Korea

[2] Seoul Natl Univ, ASRI, Seoul 08826, South Korea

[3] Delft Univ Technol, NL-2628 Delft, Netherlands

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2020年 / 5卷 / 02期

关键词：

Three-dimensional displays; Generators; Task analysis; Multiple signal classification; Skeleton; Training; Music; Gesture; posture and facial expressions; novel deep learning methods; entertainment robotics;

D O I：

10.1109/LRA.2020.2977333

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This letter proposes a framework which is able to generate a sequence of three-dimensional human dance poses for a given music. The proposed framework consists of three components: a music feature encoder, a pose generator, and a music genre classifier. We focus on integrating these components for generating a realistic 3D human dancing move from music, which can be applied to artificial agents and humanoid robots. The trained dance pose generator, which is a generative autoregressive model, is able to synthesize a dance sequence longer than 1,000 pose frames. Experimental results of generated dance sequences from various songs show how the proposed method generates human-like dancing move to a given music. In addition, a generated 3D dance sequence is applied to a humanoid robot, showing that the proposed framework can make a robot to dance just by listening to music.

引用

页码：3501 / 3508

页数：8

共 29 条

[1] [Anonymous], POP STARS OPENING CE
[2] [Anonymous], 2016, PROC 9 ISCA SPEEC
[3] Bai S., 2018, ARXIV
[4] Bengio S, 2015, ADV NEUR IN, V28
[5] Berndt D, 1994, WORKSH KNOWL DISC DA, V398, P359
[6] Bertin-Mahieux Thierry, 2011, P 12 INT C MUS INF R
[7] RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS .1. THE METHOD OF PAIRED COMPARISONS
BRADLEY, RA
TERRY, ME
[J]. BIOMETRIKA, 1952, 39 (3-4) : 324 - 345
[8] Crnkovic-Friis L., 2016, P 7 INT C COMP CREAT, P272, DOI DOI 10.48550/ARXIV.1605.06921
[9] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[10] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]

← 1 2 3 →