Convolutional Sequence Generation for Skeleton-Based Action Synthesis

被引：85

作者：

Yan, Sijie ^{[1
]}

Li, Zhizhong ^{[1
]}

Xiong, Yuanjun ^{[1
]}

Yan, Huahan ^{[1
]}

Lin, Dahua ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Peoples R China

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年

关键词：

D O I：

10.1109/ICCV.2019.00449

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we aim to generate long actions represented as sequences of skeletons. The generated sequences must demonstrate continuous, meaningful human actions, while maintaining coherence among body parts. Instead of generating skeletons sequentially following an autoregressive model, we propose a framework that generates the entire sequence altogether by transforming from a sequence of latent vectors sampled from a Gaussian process (GP). This framework, named Convolutional Sequence Generation Network (CSGN) 1, jointly models structures in temporal and spatial dimensions. It captures the temporal structure at multiple scales through the GP prior and the temporal convolutions; and establishes the spatial connection between the latent vectors and the skeleton graphs via a novel graph refining scheme. It is noteworthy that CSGN allows bidirectional transforms between the latent and the observed spaces, thus enabling semantic manipulation of the action sequences in various forms. We conducted empirical studies on multiple datasets, including a set of high-quality dancing sequences collected by us. The results show that our framework can produce long action sequences that are coherent across time steps and among body parts.

引用

页码：4393 / 4401

页数：9

共 29 条

[1]

[Anonymous], 2016, INT C LEARNING REPRE

[2]

[Anonymous], 2018, AAAI

[3]

[Anonymous], 2017, IEEE T PATTERN ANAL

[4]

[Anonymous], 2018, ARXIV180807371

[5]

[Anonymous], 2014, 3 INT C LEARN REPR

[6]

[Anonymous], 2017, P ADV NEUR INF PROC

[7]

[Anonymous], 2019, CMU GRAPH LAB MOT CA

[8]

[Anonymous], 2014, INT C LEARNING REPRE

[9] HP-GAN: Probabilistic 3D human motion prediction via GAN [J].

Barsoum, Emad ;

Kender, John ;

Liu, Zicheng .

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :1499-1508

[10] Pros and cons of GAN evaluation measures [J].

Borji, Ali .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 179 :41-65

← 1 2 3 →