共 19 条
[4]
BERTSEKAS D. P, 1978, Neuro-dynamic programming
[5]
Bertsekas DP, 1995, Dynamic programming and optimal control, V1
[6]
Bertsekas DP, 1995, Dynamic Programming and Optimal Control, V2
[7]
Cassandra A. R., 1998, Exact and approximate algorithms for partially observable Markov decision processes
[8]
Elliott R., 1995, Hidden Markov Models-Estimation and Control, V29
[9]
Hernandez-Lerma O., 1996, Discrete-Time Markov Control Processes: Basic Optimality Criteria
[10]
KUMAR P. R., 2015, Stochastic Systems: Estimation, Identification, and Adaptive Control