共 33 条
[11]
Vamvoudakis K.G., Vrabie D., Lewis F.L., Online adaptive algorithm for optimal control with integral reinforcement learning, International Journal of Robust and Nonlinear Control
[12]
Mehraeen S., Jagannathan S., Decentralized nearly optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Bellman-Jacobi formulation, Proceeding of the 2010 International Joint Conference on Neural Networks (IJCNN), pp. 1-8, (2010)
[13]
Bhasin S., Sharma N., Patre P., Dixon W., Asymptotic tracking by a reinforcement learning-based adaptive critic controller, Journal of Control Theory and Applications, 9, 3, pp. 400-409, (2011)
[14]
Modares H., Sistani M.B.N., Lewis F.L., A policy iteration approach to online optimal control of continuous-time constrained-input systems, ISA Transactions, 52, 5, pp. 611-621, (2013)
[15]
Chen Z., Jagannathan S., Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete time systems, IEEE Transactions on Neural Networks, 19, 1, pp. 90-106, (2008)
[16]
Jiang Y., Jiang Z.P., Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics, Automatica, 48, 10, pp. 2699-2704, (2012)
[17]
Al-Tamimi A., Lewis F., Abu-Khalaf M., Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 38, 4, pp. 943-949, (2008)
[18]
Cheng T., Lewis F.L., Abu-Khalaf M., A neural network solution for fixed-final time optimal control of nonlinear systems, Automatica, 43, 3, pp. 482-490, (2007)
[19]
Kober J., Bagnell D., Peters J., Reinforcement learning in robotics: A survey, International Journal of Robotics Research, 32, 11, pp. 1236-1274, (2013)
[20]
Hasselt H., Reinforcement learning in continuous state and action spaces, Adaptation, Learning, and Optimization, pp. 207-251, (2012)