共 41 条
- [1] Abdulla Mohammed Shahid, 2007, 2007 American Control Conference, P534, DOI 10.1109/ACC.2007.4282587
- [2] [Anonymous], 2004, THESIS
- [3] [Anonymous], 1992, HDB INTELLIGENT CONT
- [4] [Anonymous], 2010, Algorithms for Reinforcement Learning
- [5] Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics