An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots

被引：0

作者：

Rafiuddin Syam

Keigo Watanabe

Kiyotaka Izumi

机构：

[1] Saga University,Department of Advanced Systems Control Engineering, Graduate School of Science and Engineering

来源：

Soft Computing | 2007年 / 11卷

关键词：

Actor-critic algorithms; Kinematic model; Multi-step prediction; Nonholonomic mobile robot; Nonlinear predictive model; Simulated experience;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we propose a new algorithm of an adaptive actor-critic method with multi-step simulated experiences, as a kind of temporal difference (TD) method. In our approach, the TD-error is composed of two value- functions and m utility functions, where m denotes the number of multi-steps in which the experience should be simulated. The value-function is constructed from the critic formulated by a radial basis function neural network (RBFNN), which has a simulated experience as an input, generated from a predictive model based on a kinematic model. Thus, since our approach assumes that the model is available to simulate the m-step experiences and to design a controller, such a kinematic model is also applied to construct the actor and the resultant model based actor (MBA) is also regarded as a network, i.e., it is just viewed as a resolved velocity control network. We implement this approach to control nonholonomic mobile robot, especially in a trajectory tracking control problem for the position coordinates and azimuth. Some simulations show the effectiveness of the proposed method for controlling a mobile robot with two-independent driving wheels.

引用

页码：81 / 89

页数：8

共 21 条

[1]

van Buijtenen WM(1998)Adaptive fuzzy control of satellite attitude by reinforcement learning IEEE Trans Fuzzy Syst 6 185-194

[2]

Schram G(1988)Learning to predict by the methods of temporal differences Mach Learn 3 9-44

[3]

Babuška R(2003)On actor-critic algorithms SIAM J Contr Optim 42 1143-1166

[4]

Verbruggen HB(1997)Adaptive critic designs IEEE Trans Neural Network 8 997-1007

[5]

Sutton RS(1983)Neuron-like adaptive elements can solve difficult learning control problems IEEE Trans Syst Man, Cybernet 13 834-846

[6]

Konda VR(1996)A fuzzy-Gaussian neural network and its application to mobile robot IEEE Trans Contr Syst Technol 4 193-199

[7]

Tsitsiklis JN(1998)Control of nonholonomic mobile robot using neural networks IEEE Trans Neural Network 9 589-600

[8]

Prokhorov DV(1997)Control of a nonholonomic mobile robot: backstepping kinematics into dynamics J Robot Syst 14 149-163

[9]

Wunch DC(undefined)undefined undefined undefined undefined-undefined

[10]

Barto AG(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 →