A diffusion wavelets-based multiscale framework for inverse optimal control of stochastic systems

被引：0

作者：

Ha, Jung-Su ^{[1
,2
]}

Chae, Hyeok-Joo ^{[3
,4
]}

Choi, Han-Lim ^{[3
,4
]}

机构：

[1] Tech Univ Berlin, Learning & Intelligent Syst Lab, Berlin, Germany

[2] Max Planck Inst Intelligent Syst, Stuttgart, Germany

[3] Korea Adv Inst Sci & Technol, Dept Aerosp Engn, 291 Daehak Ro, Daejeon, South Korea

[4] Korea Adv Inst Sci & Technol, KI Robot, 291 Daehak Ro, Daejeon, South Korea

来源：

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE | 2021年 / 52卷 / 11期

基金：

新加坡国家研究基金会;

关键词：

Inverse optimal control; diffusion wavelets; multiresolution analysis;

D O I：

10.1080/00207721.2021.1882011

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work presents a multiscale framework to solve a class of inverse optimal control (IOC) problems in the context of robot motion planning and control in a complex environment. In order to handle complications resulting from a large decision space and complex environmental geometry, two key concepts are adopted: (a) a diffusion wavelet representation of the Markov chain for hierarchical abstraction of the state space; and (b) a desirability function-based representation of the Markov decision process (MDP) to efficiently calculate the optimal policy. An IOC problem constructed on a 'abstract state' is solved, which is much more tractable than using the original bases set; moreover, the solution can be obtained recursively in the 'coarse to fine' direction by utilizing the hierarchical structure of basis functions. The resulting multiscale plan is utilized to finally compute a continuous-time optimal control policy within a receding horizon implementation. Illustrative numerical experiments on a robot path control in a complex environment and on a quadrotor ball-catching task are presented to verify the proposed method.

引用

页码：2228 / 2240

页数：13

共 28 条

[1]

[Anonymous], 2004, Introduction to machine learning

[2]

[Anonymous], 1986, Introduction to applied mathematics

[3]

Choi H.L., 2017, IEEE INT C ROB AUT I, DOI [10.1109/icra.2017.7989085, DOI 10.1109/ICRA.2017.7989085]

[4] Diffusion wavelets [J].

Coifman, Ronald R. ;

Maggioni, Mauro .

APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2006, 21 (01) :53-94

[5]

Dvijotham Krishnamurthy, 2010, P 27 INT C MACH LEAR, P335, DOI DOI 10.0RG/PAPERS/571.PDF

[6]

Finn C, 2016, PR MACH LEARN RES, V48

[7]

Gardiner C W, 1985, HDB STOCHASTIC METHO, V4

[8]

Goswami J.C., 2011, Fundamentals of wavelets: theory, algorithms, and applications, V233

[9]

Ha J.S., 2018, P 32 INT C NEUR INF, P8941

[10] Approximate Inference-Based Motion Planning by Learning and Exploiting Low-Dimensional Latent Variable Models [J].

Ha, Jung-Su ;

Chae, Hyeok-Joo ;

Choi, Han-Lim .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :3892-3899

← 1 2 3 →