Efficient Hyperparameter Optimization for Physics-based Character Animation

被引:3
作者
Yang, Zeshi [1 ]
Yin, Zhiqi [1 ]
机构
[1] Simon Fraser Univ, Burnaby, BC, Canada
关键词
Physics-based Character Animation; Bayesian Optimization; Reinforcement Learning; Curriculum Learning; DESIGN;
D O I
10.1145/3451254
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Physics-based character animation has seen significant advances in recent years with the adoption of Deep Reinforcement Learning (DRL). However, DRL-based learning methods are usually computationally expensive and their performance crucially depends on the choice of hyperparameters. Tuning hyperparameters for these methods often requires repetitive training of control policies, which is even more computationally prohibitive. In this work, we propose a novel Curriculum-based Multi-Fidelity Bayesian Optimization framework (CMFBO) for efficient hyperparameter optimization of DRL-based character control systems. Using curriculum-based task difficulty as fidelity criterion, our method improves searching efficiency by gradually pruning search space through evaluation on easier motor skill tasks. We evaluate our method on two physics-based character control tasks: character morphology optimization and hyperparameter tuning of DeepMimic. Our algorithm significantly outperforms state-of-the-art hyperparameter optimization methods applicable for physics-based character animation. In particular, we show that hyperparameters optimized through our algorithm result in at least 5x efficiency gain comparing to author-released settings in DeepMimic.
引用
收藏
页数:19
相关论文
共 89 条
[21]   Reinforcement Learning for Improving Agent Design [J].
Ha, David .
ARTIFICIAL LIFE, 2019, 25 (04) :352-365
[22]  
Ha S, 2017, ROBOTICS: SCIENCE AND SYSTEMS XIII
[23]  
Haarnoja T, 2018, PR MACH LEARN RES, V80
[24]   Online Control of Simulated Humanoids Using Particle Belief Propagation [J].
Hamalainen, Perttu ;
Rajamaki, Joose ;
Liu, C. Karen .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04)
[25]  
Hansen N, 2006, STUD FUZZ SOFT COMP, V192, P75
[26]  
Hodgins J. K., 1995, Computer Graphics Proceedings. SIGGRAPH 95, P71, DOI 10.1145/218380.218414
[27]  
Huang Wenlong, 2020, PR MACH LEARN RES, V119
[28]  
Hutter F, 2013, 23 INT JOINT C ART I
[29]   Optimization-Based Interactive Motion Synthesis [J].
Jain, Sumit ;
Ye, Yuting ;
Liu, C. Karen .
ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (01)
[30]  
Jaquier Noemie, 2020, C ROB LEARN, P233