共 21 条
[1]
van Buijtenen WM(1998)Adaptive fuzzy control of satellite attitude by reinforcement learning IEEE Trans Fuzzy Syst 6 185-194
[2]
Schram G(1988)Learning to predict by the methods of temporal differences Mach Learn 3 9-44
[3]
Babuška R(2003)On actor-critic algorithms SIAM J Contr Optim 42 1143-1166
[4]
Verbruggen HB(1997)Adaptive critic designs IEEE Trans Neural Network 8 997-1007
[5]
Sutton RS(1983)Neuron-like adaptive elements can solve difficult learning control problems IEEE Trans Syst Man, Cybernet 13 834-846
[6]
Konda VR(1996)A fuzzy-Gaussian neural network and its application to mobile robot IEEE Trans Contr Syst Technol 4 193-199
[7]
Tsitsiklis JN(1998)Control of nonholonomic mobile robot using neural networks IEEE Trans Neural Network 9 589-600
[8]
Prokhorov DV(1997)Control of a nonholonomic mobile robot: backstepping kinematics into dynamics J Robot Syst 14 149-163
[9]
Wunch DC(undefined)undefined undefined undefined undefined-undefined
[10]
Barto AG(undefined)undefined undefined undefined undefined-undefined