Implicit Neural Representations for Variable Length Human Motion Generation

被引:34
作者
Cervantes, Pablo [1 ]
Sekikawa, Yusuke [2 ]
Sato, Ikuro [1 ,2 ]
Shinoda, Koichi [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
[2] Denso IT Lab Inc, Tokyo, Japan
来源
COMPUTER VISION - ECCV 2022, PT XVII | 2022年 / 13677卷
关键词
Motion generation; Implicit Neural Representations;
D O I
10.1007/978-3-031-19790-1_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an action-conditional human motion generation method using variational implicit neural representations (INR). The variational formalism enables action-conditional distributions of INRs, from which one can easily sample representations to generate novel human motion sequences. Our method offers variable-length sequence generation by construction because a part of INR is optimized for a whole sequence of arbitrary length with temporal embeddings. In contrast, previous works reported difficulties with modeling variable-length sequences. We confirm that our method with a Transformer decoder outperforms all relevant methods on HumanAct12, NTU-RGBD, and UESTC datasets in terms of realism and diversity of generated motions. Surprisingly, even our method with an MLP decoder consistently outperforms the state-of-the-art Transformer-based auto-encoder. In particular, we show that variable-length motions generated by our method are better than fixedlength motions generated by the state-of-the-art method in terms of realism and diversity. Code at https://github.com/PACerv/ImplicitMotion.
引用
收藏
页码:356 / 372
页数:17
相关论文
共 41 条
[1]   Image Generators with Conditionally-Independent Pixel Synthesis [J].
Anokhin, I ;
Demochkin, K. ;
Khakhulin, T. ;
Sterkin, G. ;
Lempitsky, V ;
Korzhenkov, D. .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14273-14282
[2]  
[Anonymous], 2021, NeurIPS
[3]  
[Anonymous], 2014, INT C LEARNING REPRE
[4]   HP-GAN: Probabilistic 3D human motion prediction via GAN [J].
Barsoum, Emad ;
Kender, John ;
Liu, Zicheng .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :1499-1508
[5]   GlocalNet: Class-aware Long-term Human Motion Synthesis [J].
Battan, Neeraj ;
Agrawal, Yudhik ;
Rao, Sai Soorya ;
Goel, Aman ;
Sharma, Avinash .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :878-887
[6]   Deep representation learning for human motion prediction and classification [J].
Butepage, Judith ;
Black, Michael J. ;
Kragic, Danica ;
Kjellstrom, Hedvig .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1591-1599
[7]  
Chen Y., 2020, BMVC
[8]   Learning Implicit Fields for Generative Shape Modeling [J].
Chen, Zhiqin ;
Zhang, Hao .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5932-5941
[9]  
DeVries T., 2021, ICCV, P14304
[10]  
Doersch C, 2019, ADV NEUR IN, V32