共 38 条
[2]
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2008, 38 (04)
:943-949
[5]
An Introduction to Deep Reinforcement Learning
[J].
FOUNDATIONS AND TRENDS IN MACHINE LEARNING,
2018, 11 (3-4)
:219-354
[6]
Gu SX, 2016, PR MACH LEARN RES, V48
[7]
Harmon M. E., 1995, Advances in Neural Information Processing Systems 7, P353