Constrained discounted Markov decision processes and Hamiltonian cycles

被引:0
|
作者
Feinberg, EA [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper establishes new links between stochastic and discrete optimization. We consider the following three problems for discrete time Markov Decision Processes with finite states and action sets: (i) find an optimal deterministic policy for a discounted problem with constraints, (ii) find an optimal stationary policy for a weighted discounted problem with constraints, (iii) find an optimal deterministic policy for a weighted discounted problem with constraints. We show that the Hamiltonian Cycle Problem is a special case of each of these problems. Therefore problems (i - iii) are NP-hard in spite of the fact that a minor modification of problem (i) is polynomially solvable. We construct mathematical programs for each of these problems and formulate algorithms for the Hamiltonian Cycle and Traveling Salesman Problems that follow from the algorithms for problems (i - iii).
引用
收藏
页码:2821 / 2826
页数:6
相关论文
共 50 条
  • [21] CONSTRAINED MARKOV DECISION-MODELS WITH WEIGHTED DISCOUNTED REWARDS
    FEINBERG, EA
    SHWARTZ, A
    MATHEMATICS OF OPERATIONS RESEARCH, 1995, 20 (02) : 302 - 320
  • [22] Hierarchical algorithms for discounted and weighted Markov decision processes
    Abbad, M
    Daoui, C
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 58 (02) : 237 - 245
  • [23] Discounted Markov Decision Processes for Small Noise Intensities
    Cruz-Suarez, Hugo
    Ilhuicatzi-Roldan, Rocio
    RECENT ADVANCES IN APPLIED MATHEMATICS, 2009, : 245 - +
  • [24] A Version of the Euler Equation in Discounted Markov Decision Processes
    Cruz-Suarez, H.
    Zacarias-Espinoza, G.
    Vazquez-Guevara, V.
    JOURNAL OF APPLIED MATHEMATICS, 2012,
  • [25] A note on deterministic approximation of discounted Markov decision processes
    Cruz-Suarez, Hugo
    Gordienko, Evgueni
    Montes-de-Oca, Raul
    APPLIED MATHEMATICS LETTERS, 2009, 22 (08) : 1252 - 1256
  • [26] Discounted Markov Decision Processes via Time Aggregation
    Arruda, Edilson F.
    Fragoso, Marcelo D.
    2016 EUROPEAN CONTROL CONFERENCE (ECC), 2016, : 2578 - 2583
  • [27] Hierarchical algorithms for discounted and weighted Markov decision processes
    M. Abbad
    C. Daoui
    Mathematical Methods of Operations Research, 2003, 58 : 237 - 245
  • [28] Constrained Markov control processes in Borel spaces: the discounted case
    Onésimo Hernández-Lerma
    Juan González-Hernández
    Mathematical Methods of Operations Research, 2000, 52 : 271 - 285
  • [29] Limits of multi-discounted Markov decision processes
    Gimbert, Hugo
    Zielonka, Wieslaw
    22ND ANNUAL IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE, PROCEEDINGS, 2007, : 89 - +
  • [30] Constrained Markov control processes in Borel spaces:: the discounted case
    Hernández-Lerma, O
    González-Hernández, J
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2000, 52 (02) : 271 - 285