共 21 条
[1]
Widrow B., Gupta N.K., Maitra S., Punish/reward: learning with a critic in adaptive threshold systems, IEEE Transactions on Systems, Man, and Cybernetics, 3, 5, pp. 455-465, (1973)
[2]
Barto A.G., Sutton R.S., Anderson C.W., Neuronlike adaptive elements that can solve difficult learning control problems, IEEE Transactions on Systems, Man, and Cybernetics, 13, 5, pp. 835-846, (1983)
[3]
Werbos P.J., Approximate dynamic programming for real-time control and neural modeling, Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, (1992)
[4]
Bertsekas D.P., Tsitsiklis J.N., Neuro-Dynamic Programming, (1996)
[5]
Prokhorov D.V., Wunsch D.C., Adaptive critic designs, IEEE Transactions on Neural Networks, 8, 5, pp. 997-1007, (1997)
[6]
Si J., Wang Y.T., Online learning control by association and reinforcement, IEEE Transactions on Neural Networks, 12, 2, pp. 264-276, (2001)
[7]
Liu D.R., Xiong X.X., Zhang Y., Action-dependent adaptive critic designs, Proceedings of the International Joint Conference on Neural Networks, pp. 990-995, (2001)
[8]
Murray J.J., Cox C.J., Lendaris G.G., Saeks R., Adaptive dynamic programming, IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 32, 2, pp. 140-153, (2002)
[9]
Abu-Khalaf M., Lewis F.L., Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach, Automatica, 41, 5, pp. 779-791, (2005)
[10]
Liu D.-R., Approximate dynamic programming for self-learning control, Acta Automatica Sinica, 31, 1, pp. 13-18, (2005)