Constrained Markovian decision processes: the dynamic programming approach

被引:27
|
作者
Piunovskiy, AB
Mao, X
机构
[1] Univ Liverpool, Dept Math Sci, Div Stat & Operat Res, Liverpool L69 7ZL, Merseyside, England
[2] Univ Strathclyde, Glasgow, Lanark, Scotland
关键词
Markovian decision processes; constrained optimization; dynamic programming; penalty functions;
D O I
10.1016/S0167-6377(00)00039-0
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We consider semicontinuous controlled Markov models in discrete time with total expected losses. Only control strategies which meet a set of given constraint inequalities are admissible. One has to build an optimal admissible strategy. The main result consists in the constructive development of optimal strategy with the help of the dynamic programming method. The model studied covers the case of a finite horizon and the case of a homogeneous discounted model with different discount factors. (C) 2000 Elsevier Science B,V. All rights reserved.
引用
收藏
页码:119 / 126
页数:8
相关论文
共 50 条
  • [1] Dynamic programming in constrained Markov decision processes
    Piunovskiy, A. B.
    CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660
  • [2] A multi-parametric programming approach for constrained dynamic programming problems
    Faisca, Nuno P.
    Kouramas, Konstantinos I.
    Saraiva, Pedro M.
    Rustem, Berc
    Pistikopoulos, Efstratios N.
    OPTIMIZATION LETTERS, 2008, 2 (02) : 267 - 280
  • [3] A multi-parametric programming approach for constrained dynamic programming problems
    Nuno P. Faísca
    Konstantinos I. Kouramas
    Pedro M. Saraiva
    Berç Rustem
    Efstratios N. Pistikopoulos
    Optimization Letters, 2008, 2 : 267 - 280
  • [4] Dynamic Programming approach for constrained Model Predictive Control
    Soufian, M
    Sandoz, DJ
    Soufian, M
    SYSTEM STRUCTURE AND CONTROL 1997, 1998, : 219 - 224
  • [5] Dynamic programming approach to optimization of approximate decision rules
    Amin, Talha
    Chikalov, Igor
    Moshkov, Mikhail
    Zielosko, Beata
    INFORMATION SCIENCES, 2013, 221 : 403 - 418
  • [6] Dynamic Programming Approach for Partial Decision Rule Optimization
    Amin, Talha
    Chikalov, Igor
    Moshkov, Mikhail
    Zielosko, Beata
    FUNDAMENTA INFORMATICAE, 2012, 119 (3-4) : 233 - 248
  • [7] Sleeping experts and bandits approach to constrained Markov decision processes
    Chang, Hyeong Soo
    AUTOMATICA, 2016, 63 : 182 - 186
  • [8] Constrained discounted dynamic programming
    Feinberg, EA
    Shwartz, A
    MATHEMATICS OF OPERATIONS RESEARCH, 1996, 21 (04) : 922 - 945
  • [9] Time aggregated Markov decision processes via standard dynamic programming
    Arruda, Edilson F.
    Fragoso, Marcelo D.
    OPERATIONS RESEARCH LETTERS, 2011, 39 (03) : 193 - 197
  • [10] On constrained Markov decision processes
    Haviv, M
    OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28