Optimal control with adaptive internal dynamics models

被引:0
作者
Mitrovic, Djordje [1 ]
Klanke, Stefan [1 ]
Vijayakumar, Sethu [1 ]
机构
[1] Univ Edinburgh, Inst Percept Act & Behav, Sch Informat, Edinburgh, Midlothian, Scotland
来源
ICINCO 2008: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL ICSO: INTELLIGENT CONTROL SYSTEMS AND OPTIMIZATION | 2008年
关键词
learning dynamics; optimal control; adaptive control; robot simulation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimal feedback control has been proposed as an attractive movement generation strategy in goal reaching tasks for anthropomorphic manipulator systems. The optimal feedback control law for systems with non-linear dynamics and non-quadratic costs can be found by iterative methods, such as the iterative Linear Quadratic Gaussian (iLQG) algorithm. So far this framework relied on an analytic form of the system dynamics, which may often be unknown, difficult to estimate for more realistic control systems or may be subject to frequent systematic changes. In this paper, we present a novel combination of learning a forward dynamics model within the iLQG framework. Utilising such adaptive internal models can compensate for complex dynamic perturbations of the controlled system in an online fashion. The specific adaptive framework introduced lends itself to a computationally more efficient implementation of the iLQG optimisation without sacrificing control accuracy-allowing the method to scale to large DoF systems.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 21 条
[1]  
Agarwal Shivani, 2006, P 23 INT C MACH LEAR
[2]  
Atkeson CG, 1997, ARTIF INTELL REV, V11, P75, DOI 10.1023/A:1006511328852
[3]  
Atkeson CG, 1997, IEEE INT CONF ROBOT, P1706, DOI 10.1109/ROBOT.1997.614389
[4]   Randomly sampling actions in dynamic programming [J].
Atkeson, Christopher G. .
2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, :185-192
[5]  
Bertsekas D.P., 2005, DYNAMIC PROGRAMMING, V1
[6]   THE COORDINATION OF ARM MOVEMENTS - AN EXPERIMENTALLY CONFIRMED MATHEMATICAL-MODEL [J].
FLASH, T ;
HOGAN, N .
JOURNAL OF NEUROSCIENCE, 1985, 5 (07) :1688-1703
[7]   Signal-dependent noise determines motor planning [J].
Harris, CM ;
Wolpert, DM .
NATURE, 1998, 394 (6695) :780-784
[8]  
Jacobson D. H., 1970, Differential Dynamic Programming. American
[9]   Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic system [J].
Li, W. ;
Todorov, E. .
INTERNATIONAL JOURNAL OF CONTROL, 2007, 80 (09) :1439-1453
[10]  
Li W, 2004, Proceedings of the 1st International Conference on Informatics in Control, Automation and Robotics, P222, DOI DOI 10.5220/0001143902220229