Existence of Markov controls and characterization of optimal Markov controls

被引:100
作者
Kurtz, TG [1 ]
Stockbridge, RH
机构
[1] Univ Wisconsin, Dept Math, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
[3] Univ Kentucky, Dept Stat, Lexington, KY 40506 USA
关键词
Markov controls; optimal controls; martingale problems; stationary processes; linear programming; occupation measures;
D O I
10.1137/S0363012995295516
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a solution of a controlled martingale problem it is shown under general conditions that there exists a solution having Markov controls which has the same cost as the original solution. This result is then used to show that the original stochastic control problem is equivalent to a linear program over a space of measures under a variety of optimality criteria. Existence and characterization of optimal Markov controls then follows. An extension of Echeverria's theorem characterizing stationary distributions for (uncontrolled) Markov processes is obtained as a corollary. In particular, this extension covers diffusion processes with discontinuous drift and diffusion coefficients.
引用
收藏
页码:609 / 653
页数:45
相关论文
共 20 条
[1]  
[Anonymous], 1985, TRANSL SER MATH ENGR
[2]  
Bhati AG, 1996, ANN PROBAB, V24, P1531
[3]   INVARIANT-MEASURES AND EVOLUTION-EQUATIONS FOR MARKOV-PROCESSES CHARACTERIZED VIA MARTINGALE PROBLEMS [J].
BHATT, AG ;
KARANDIKAR, RL .
ANNALS OF PROBABILITY, 1993, 21 (04) :2246-2268
[4]  
Borkar V. S., 1986, Stochastics, V18, P17, DOI 10.1080/17442508608833398
[5]   ERGODIC CONTROL OF MULTIDIMENSIONAL DIFFUSIONS .1. THE EXISTENCE RESULTS [J].
BORKAR, VS ;
GHOSH, MK .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1988, 26 (01) :112-126
[6]   LINEAR PROGRAMMING IN A MARKOV DECISION PROBLEM [J].
DENARDO, EV .
MANAGEMENT SCIENCE SERIES A-THEORY, 1970, 16 (05) :281-288
[7]   ON SEQUENTIAL DECISIONS AND MARKOV-CHAINS [J].
DERMAN, C .
MANAGEMENT SCIENCE, 1962, 9 (01) :16-24
[8]  
Dvoretzky A., 1972, P 6 BERK S MATH STAT, P513
[9]  
El Karoui N., 1987, Stochastics, V20, P169, DOI 10.1080/17442508708833443
[10]  
ETHIER S., 1986, MARKOV PROCESSES CHA, DOI 10.1002/9780470316658