共 41 条
[1]
Abdulla Mohammed Shahid, 2007, 2007 American Control Conference, P534, DOI 10.1109/ACC.2007.4282587
[2]
[Anonymous], 2004, THESIS
[3]
[Anonymous], 1992, HDB INTELLIGENT CONT
[4]
[Anonymous], 2010, Algorithms for Reinforcement Learning
[5]
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics