共 54 条
- [4] Guo L., 2005, INTRO CONTROL THEORY
- [5] Hardy G.H., 1952, INEQUALITIES
- [10] Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (01): : 14 - 25