A diffusion wavelets-based multiscale framework for inverse optimal control of stochastic systems

被引:0
作者
Ha, Jung-Su [1 ,2 ]
Chae, Hyeok-Joo [3 ,4 ]
Choi, Han-Lim [3 ,4 ]
机构
[1] Tech Univ Berlin, Learning & Intelligent Syst Lab, Berlin, Germany
[2] Max Planck Inst Intelligent Syst, Stuttgart, Germany
[3] Korea Adv Inst Sci & Technol, Dept Aerosp Engn, 291 Daehak Ro, Daejeon, South Korea
[4] Korea Adv Inst Sci & Technol, KI Robot, 291 Daehak Ro, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
Inverse optimal control; diffusion wavelets; multiresolution analysis;
D O I
10.1080/00207721.2021.1882011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work presents a multiscale framework to solve a class of inverse optimal control (IOC) problems in the context of robot motion planning and control in a complex environment. In order to handle complications resulting from a large decision space and complex environmental geometry, two key concepts are adopted: (a) a diffusion wavelet representation of the Markov chain for hierarchical abstraction of the state space; and (b) a desirability function-based representation of the Markov decision process (MDP) to efficiently calculate the optimal policy. An IOC problem constructed on a 'abstract state' is solved, which is much more tractable than using the original bases set; moreover, the solution can be obtained recursively in the 'coarse to fine' direction by utilizing the hierarchical structure of basis functions. The resulting multiscale plan is utilized to finally compute a continuous-time optimal control policy within a receding horizon implementation. Illustrative numerical experiments on a robot path control in a complex environment and on a quadrotor ball-catching task are presented to verify the proposed method.
引用
收藏
页码:2228 / 2240
页数:13
相关论文
共 28 条
[1]  
[Anonymous], 2004, Introduction to machine learning
[2]  
[Anonymous], 1986, Introduction to applied mathematics
[3]  
Choi H.L., 2017, IEEE INT C ROB AUT I, DOI [10.1109/icra.2017.7989085, DOI 10.1109/ICRA.2017.7989085]
[4]   Diffusion wavelets [J].
Coifman, Ronald R. ;
Maggioni, Mauro .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2006, 21 (01) :53-94
[5]  
Dvijotham Krishnamurthy, 2010, P 27 INT C MACH LEAR, P335, DOI DOI 10.0RG/PAPERS/571.PDF
[6]  
Finn C, 2016, PR MACH LEARN RES, V48
[7]  
Gardiner C W, 1985, HDB STOCHASTIC METHO, V4
[8]  
Goswami J.C., 2011, Fundamentals of wavelets: theory, algorithms, and applications, V233
[9]  
Ha J.S., 2018, P 32 INT C NEUR INF, P8941
[10]   Approximate Inference-Based Motion Planning by Learning and Exploiting Low-Dimensional Latent Variable Models [J].
Ha, Jung-Su ;
Chae, Hyeok-Joo ;
Choi, Han-Lim .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :3892-3899