共 54 条
[4]
GUO L, 2005, INTRO CONTROL THEORY
[5]
Hardy G. H., 1952, Inequalities
[10]
Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2011, 41 (01)
:14-25