共 27 条
- [1] Bellman R., On the theory of dynamic programming, Proc Natl Acad Sci USA, 38, 8, (1952)
- [2] Bellman R., The theory of dynamic programming, Bull Amer Math Soc, 60, 6, pp. 503-515, (1954)
- [3] Bellman R., Dynamic programming, Science, 153, 3731, pp. 34-37, (1966)
- [4] Sutton R.S., Barto A.G., Reinforcement learning: an introduction, (1998)
- [5] Powell W.B., Approximate dynamic programming: solving the curses of dimensionality, 703, (2007)
- [6] Lewis F.L., Liu D., Reinforcement learning and approximate dynamic programming for feedback control, 17, (2013)
- [7] Novoa C., Storer R., An approximate dynamic programming approach for the vehicle routing problem with stochastic demands, Eur J Oper Res, 196, 2, pp. 509-515, (2009)
- [8] Al-Tamimi A., Lewis F.L., Abu-Khalaf M., Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof, IEEE Trans Syst Man Cyber Part B (Cyber), 38, 4, pp. 943-949, (2008)
- [9] Wei Q., Liu D., Lin H., Value iteration adaptive dynamic programming for optimal control of discrete-time nonlinear systems, IEEE Trans Cybern, 46, 3, pp. 840-853, (2016)
- [10] Wu H.N., Luo B., Heuristic dynamic programming algorithm for optimal control design of linear continuous-time hyperbolic pde systems, Ind Eng Chem Res, 51, 27, pp. 9310-9319, (2012)