共 32 条
- [3] Bäuerle N, 2011, UNIVERSITEXT, P1, DOI 10.1007/978-3-642-18324-9
- [6] VARIANCE-PENALIZED MARKOV DECISION-PROCESSES [J]. MATHEMATICS OF OPERATIONS RESEARCH, 1989, 14 (01) : 147 - 161
- [8] Guo XP, 2009, STOCH MOD APPL PROBA, V62, P1, DOI 10.1007/978-3-642-02547-1_1