Approximation of Constrained Average Cost Markov Control Processes

被引：0

作者：

Sutter, Tobias ^{[1
]}

Esfahani, Peyman Mohajerin ^{[1
]}

Lygeros, John ^{[1
]}

机构：

[1] ETH, Automat Control Lab, CH-8092 Zurich, Switzerland

来源：

2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2014年

关键词：

LINEAR-PROGRAMMING APPROACH;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper considers discrete-time constrained Markov control processes (MCPs) under the long-run expected average cost optimality criterion. For Borel state and action spaces a two-step method is presented to numerically approximate the optimal value of this constrained MCPs. The proposed method employs the infinite-dimensional linear programming (LP) representation of the constrained MCPs. In particular, we establish a bridge from the infinite-dimensional LP characterization to a finite LP consisting of a first asymptotic step and a second step that provides explicit bounds on the approximation error. Finally, the applicability and performance of the theoretical results are demonstrated on an LQG example.

引用

页码：6597 / 6602

页数：6

共 27 条

[1] [Anonymous], APPROXIMATE DYNAMIC
[2] [Anonymous], 1987, WILEY INTERSCIENCE S
[3] [Anonymous], INT SERIES OPERATION
[4] [Anonymous], 1996, PRINCETON MATH SER
[5] DISCRETE-TIME CONTROLLED MARKOV-PROCESSES WITH AVERAGE COST CRITERION - A SURVEY
ARAPOSTATHIS, A
BORKAR, VS
FERNANDEZGAUCHERAND, E
GHOSH, MK
MARCUS, SI
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1993, 31 (02) : 282 - 344
[6] Barvinok A., 2002, GRADUATE STUDIES MAT
[7] BenTal A, 2009, PRINC SER APPL MATH, P1
[8] Bertsekas D, 2012, DYNAMIC PROGRAMMING, V1
[9] Dynamic programming and suboptimal control: A survey from ADP to MPC
Bertsekas, DP
[J]. EUROPEAN JOURNAL OF CONTROL, 2005, 11 (4-5) : 310 - 334
[10] Borkar V., 2002, INT SERIES OPERATION, V40

← 1 2 3 →