共 16 条
- [1] Benito F(1982)Calculating the variance in Markov processes with random reward Trabajos Estadistica Investigacion Operativa 33 73-85
- [2] Filar J(1989)Variance penalized Markov decision processes Math Oper Res 14 147-161
- [3] Kallenberg LCM(1994)On finding optimal policies for Markov decision chains: a unifying framework for mean-variance-tradeoffs Math Oper Res 19 434-448
- [4] Lee H-M(1972)Markov decision processes with a new optimality criterion: small interest rates Ann Math Stat 43 1894-1901
- [5] Huang Y(1973)Markov decision processes with a new optimality criterion: discrete time Ann Statist 1 496-505
- [6] Kallenberg LCM(1975)Markov decision processes with a new optimality criterion: continuous time Ann Stat 3 547-553
- [7] Jaquette SC(1997)A minimum average-variance in Markov decision processes Bull Inform Cybern 29 83-89
- [8] Jaquette SC(1987)A variance minimization problem for a Markov decision process Eur J Oper Res 31 140-145
- [9] Jaquette SC(1987)Markov decision processes with a minimum-variance criterion J Math Anal Appl 123 572-583
- [10] Kadota Y(1971)On the variance in controlled Markov chains Kybernetika 7 1-12