共 21 条
- [13] LINEAR-PROGRAMMING AND MARKOV DECISION CHAINS [J]. MANAGEMENT SCIENCE, 1979, 25 (04) : 352 - 362
- [14] Kumar P.R., 1986, ESTIMATION IDENTIFIC
- [15] Puterman M.L., 2008, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley Series in Probability and Statistics
- [17] On non-stationary policies and maximal invariant safe sets of controlled Markov chains [J]. 2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 3696 - 3701
- [18] [No title captured]
- [19] [No title captured]
- [20] [No title captured]