Asymptotic optimization of a nonlinear hybrid system governed by a Markov decision process

被引:17
作者
Altman, E [1 ]
Gaitsgory, V [1 ]
机构
[1] UNIV S AUSTRALIA,SCH MATH,THE LEVELS,SA 5095,AUSTRALIA
关键词
hybrid stochastic systems; asymptotic optimality; nonlinear dynamics; Markov decision processes; averaging;
D O I
10.1137/S0363012995279985
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider in this paper a continuous time stochastic hybrid control system with finite time horizon. The objective is to minimize a nonlinear function of the state trajectory. The state evolves according to a nonlinear dynamics. The parameters of the dynamics of the system may change at discrete times l epsilon, l = 0, 1, ..., according to a controlled Markov chain which has finite state and action spaces. Under the assumption that epsilon is a small parameter, we justify an averaging procedure allowing us to establish that our problem can be approximated by the solution of some deterministic optimal control problem.
引用
收藏
页码:2070 / 2085
页数:16
相关论文
共 25 条
[1]   PERTURBATION AND STABILITY THEORY FOR MARKOV CONTROL-PROBLEMS [J].
ABBAD, M ;
FILAR, JA .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1992, 37 (09) :1415-1420
[2]   RATE OF CONVERGENCE OF EMPIRICAL MEASURES AND COSTS IN CONTROLLED MARKOV-CHAINS AND TRANSIENT OPTIMALITY [J].
ALTMAN, E ;
ZEITOUNI, O .
MATHEMATICS OF OPERATIONS RESEARCH, 1994, 19 (04) :955-974
[3]   CONTROL OF A HYBRID STOCHASTIC-SYSTEM [J].
ALTMAN, E ;
GAITSGORY, V .
SYSTEMS & CONTROL LETTERS, 1993, 20 (04) :307-314
[4]   ADAPTIVE CONTROL OF CONSTRAINED MARKOV CHAINS: CRITERIA AND POLICIES [J].
Altman, Eitan ;
Shwartz, Adam .
ANNALS OF OPERATIONS RESEARCH, 1991, 28 (01) :101-134
[5]  
[Anonymous], PERTURBATION METHODS
[6]  
[Anonymous], MATH CTR TRACTS
[7]  
BENSOUSSAN A, 1987, LECT NOTES CONTR INF, V90, P171
[8]  
BIELECKI D, 1991, ANN OPER RES, V28, P153
[9]  
Clarke F. H., 1983, OPTIMIZATION NONSMOO
[10]   CONTRIBUTION OF STOCHASTIC CONTROL SINGULAR PERTURBATION AVERAGING AND TEAM THEORIES TO AN EXAMPLE OF LARGE-SCALE SYSTEMS - MANAGEMENT OF HYDROPOWER PRODUCTION [J].
DELEBECQUE, F ;
QUADRAT, JP .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1978, 23 (02) :209-222