Keyframe Selection from Motion Capture Sequences with Graph based Deep Reinforcement Learning

被引:0
作者
Mo, Clinton [1 ]
Hu, Kun [1 ]
Mei, Shaohui [2 ]
Chen, Zebin [3 ]
Wang, Zhiyong [4 ]
机构
[1] Univ Sydney, Sydney, NSW, Australia
[2] Northwestern Polytech Univ, Xian, Peoples R China
[3] Elect Arts, Redwood City, CA USA
[4] Univ Sydney, Sydney, NSW, Australia
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
关键词
Keyframe selection; motion capture; keyframe extraction; keyframe animation; reinforcement learning; graph convolutional networks; EXTRACTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Animation production workflows centred around motion capture techniques often require animators to edit the motion for various artistic and technical reasons. This process generally uses a set of keyframes. Unsupervised keyframe selection methods for motion capture sequences are highly demanded to reduce the laborious annotations. However, most existing methods are optimization-based, which cause the issues of flexibility and efficiency and eventually constrains the interactions and controls with animators. To address these limitations, we propose a novel graph based deep reinforcement learning method for efficient unsupervised keyframe selection. First, a reward function is devised in terms of reconstruction difference by comparing the original sequence and the interpolated sequence produced by the keyframes. The reward complies with the requirements of the animation pipeline satisfying: 1) incremental reward to evaluate the interpolated keyframes immediately; 2) order insensitivity for consistent evaluation; and 3) non-diminishing return for comparable rewards between optimal and sub-optimal solutions. Then by representing each skeleton frame as a graph, a graph-based deep agent is guided to heuristically select keyframes to maximize the reward. During the inference it is no longer necessary to estimate the reconstruction difference, and the evaluation time can be reduced significantly. The experimental results on the CMU Mocap dataset demonstrate that our proposed method is able to select keyframes at a high efficiency without clearly compromising the quality in comparison with the state-of-the-art methods.
引用
收藏
页码:5194 / 5202
页数:9
相关论文
共 40 条
[1]  
[Anonymous], 2007, COMPUTER ANIMATION S
[2]  
Apostolidis E., 2019, P 1 INT WORKSH AI SM, P17
[3]   Key Frames Extraction from Human Motion Capture Data Based on Hybrid Particle Swarm Optimization Algorithm [J].
Chang, Xiaojing ;
Yi, Pengfei ;
Zhang, Qiang .
RECENT DEVELOPMENTS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2016, 642 :335-342
[4]   Multiscale motion saliency for keyframe extraction from motion capture sequences [J].
Halit, Cihan ;
Capin, Tolga .
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2011, 22 (01) :3-14
[5]  
Hall D, 1965, ISODATA NOVEL METHOD
[6]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[7]   Graph Sequence Recurrent Neural Network for Vision-Based Freezing of Gait Detection [J].
Hu, Kun ;
Wang, Zhiyong ;
Wang, Wei ;
Martens, Kaylena A. Ehgoetz ;
Wang, Liang ;
Tan, Tieniu ;
Lewis, Simon J. G. ;
Feng, David Dagan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :1890-1901
[8]  
Ji Zhong, 2020, IEEE T NEURAL NETWOR
[9]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001
[10]  
Kipf T. N., 2017, INT C LEARN REPR ICL, DOI [10.1051/0004-6361/201527329, DOI 10.48550/ARXIV.1609.02907]