An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots

被引:0
作者
Rafiuddin Syam
Keigo Watanabe
Kiyotaka Izumi
机构
[1] Saga University,Department of Advanced Systems Control Engineering, Graduate School of Science and Engineering
来源
Soft Computing | 2007年 / 11卷
关键词
Actor-critic algorithms; Kinematic model; Multi-step prediction; Nonholonomic mobile robot; Nonlinear predictive model; Simulated experience;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a new algorithm of an adaptive actor-critic method with multi-step simulated experiences, as a kind of temporal difference (TD) method. In our approach, the TD-error is composed of two value- functions and m utility functions, where m denotes the number of multi-steps in which the experience should be simulated. The value-function is constructed from the critic formulated by a radial basis function neural network (RBFNN), which has a simulated experience as an input, generated from a predictive model based on a kinematic model. Thus, since our approach assumes that the model is available to simulate the m-step experiences and to design a controller, such a kinematic model is also applied to construct the actor and the resultant model based actor (MBA) is also regarded as a network, i.e., it is just viewed as a resolved velocity control network. We implement this approach to control nonholonomic mobile robot, especially in a trajectory tracking control problem for the position coordinates and azimuth. Some simulations show the effectiveness of the proposed method for controlling a mobile robot with two-independent driving wheels.
引用
收藏
页码:81 / 89
页数:8
相关论文
共 21 条
[1]  
van Buijtenen WM(1998)Adaptive fuzzy control of satellite attitude by reinforcement learning IEEE Trans Fuzzy Syst 6 185-194
[2]  
Schram G(1988)Learning to predict by the methods of temporal differences Mach Learn 3 9-44
[3]  
Babuška R(2003)On actor-critic algorithms SIAM J Contr Optim 42 1143-1166
[4]  
Verbruggen HB(1997)Adaptive critic designs IEEE Trans Neural Network 8 997-1007
[5]  
Sutton RS(1983)Neuron-like adaptive elements can solve difficult learning control problems IEEE Trans Syst Man, Cybernet 13 834-846
[6]  
Konda VR(1996)A fuzzy-Gaussian neural network and its application to mobile robot IEEE Trans Contr Syst Technol 4 193-199
[7]  
Tsitsiklis JN(1998)Control of nonholonomic mobile robot using neural networks IEEE Trans Neural Network 9 589-600
[8]  
Prokhorov DV(1997)Control of a nonholonomic mobile robot: backstepping kinematics into dynamics J Robot Syst 14 149-163
[9]  
Wunch DC(undefined)undefined undefined undefined undefined-undefined
[10]  
Barto AG(undefined)undefined undefined undefined undefined-undefined