共 12 条
[1]
Altman E., Shwartz A., Sensitivity of constrained Markov decision processes, (1990)
[2]
Beutler F.J, Ross K.W, Optimal policies for controlled Markov chains with a constraint, J. Math. Anal. Appl., 112, pp. 236-252, (1985)
[3]
Billingsley P., Convergence of probability measures, (1968)
[4]
Borkar V.S, Control of Markov chains with long-run average cost criterion: the dynamic programmin equations, SIAM J. Control Optim., 27, pp. 642-657, (1989)
[5]
Borkar V.S, Topics in controlled Markov chains, Pitman research notes in mathematics, (1991)
[6]
Dubins L., On extreme points of convex sets, J. Math. Anal. Appl., 5, pp. 237-244, (1962)
[7]
Hordijk A., Kallenberg L.C.M, Constrained undiscounted stochastic dynamic programming, Math. Oper. Res., 9, pp. 276-289, (1984)
[8]
Luenberger D., Optimization by vector space methods, (1967)
[9]
Phelps R., Lectures on Choquet’s theorem, (1966)
[10]
Ross K.W, Randomized and past-dependent policies for Markov decision processes with multiple constraints, Oper. Res., 37, pp. 474-477, (1989)