Approximation of Constrained Average Cost Markov Control Processes

被引:0
作者
Sutter, Tobias [1 ]
Esfahani, Peyman Mohajerin [1 ]
Lygeros, John [1 ]
机构
[1] ETH, Automat Control Lab, CH-8092 Zurich, Switzerland
来源
2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2014年
关键词
LINEAR-PROGRAMMING APPROACH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers discrete-time constrained Markov control processes (MCPs) under the long-run expected average cost optimality criterion. For Borel state and action spaces a two-step method is presented to numerically approximate the optimal value of this constrained MCPs. The proposed method employs the infinite-dimensional linear programming (LP) representation of the constrained MCPs. In particular, we establish a bridge from the infinite-dimensional LP characterization to a finite LP consisting of a first asymptotic step and a second step that provides explicit bounds on the approximation error. Finally, the applicability and performance of the theoretical results are demonstrated on an LQG example.
引用
收藏
页码:6597 / 6602
页数:6
相关论文
共 27 条
  • [1] [Anonymous], APPROXIMATE DYNAMIC
  • [2] [Anonymous], 1987, WILEY INTERSCIENCE S
  • [3] [Anonymous], INT SERIES OPERATION
  • [4] [Anonymous], 1996, PRINCETON MATH SER
  • [5] DISCRETE-TIME CONTROLLED MARKOV-PROCESSES WITH AVERAGE COST CRITERION - A SURVEY
    ARAPOSTATHIS, A
    BORKAR, VS
    FERNANDEZGAUCHERAND, E
    GHOSH, MK
    MARCUS, SI
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1993, 31 (02) : 282 - 344
  • [6] Barvinok A., 2002, GRADUATE STUDIES MAT
  • [7] BenTal A, 2009, PRINC SER APPL MATH, P1
  • [8] Bertsekas D, 2012, DYNAMIC PROGRAMMING, V1
  • [9] Dynamic programming and suboptimal control: A survey from ADP to MPC
    Bertsekas, DP
    [J]. EUROPEAN JOURNAL OF CONTROL, 2005, 11 (4-5) : 310 - 334
  • [10] Borkar V., 2002, INT SERIES OPERATION, V40