A characterization of the optimal risk-sensitive average cost in finite controlled Markov chains

被引:29
作者
Cavazos-Cadena, R [1 ]
Hernández-Hernández, D
机构
[1] Univ Autonoma Agraria Antonio Narro, Dept Estadist & Calculo, Saltillo 25315, Coahuila, Mexico
[2] Ctr Invest Matemat, Guanajuato 36000, GTO, Mexico
关键词
decreasing function along trajectories; stopping time; nearly optimal policies; Holder's inequality; simultaneous Doeblin condition; recurrent state;
D O I
10.1214/105051604000000585
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This work concerns controlled Markov chains with finite state and action spaces. The transition law satisfies the simultaneous Doeblin condition, and the performance of a control policy is measured by the (long-run) risk-sensitive average cost criterion associated to a positive, but otherwise arbitrary, risk sensitivity coefficient. Within this context, the optimal risk-sensitive average cost is characterized via a minimization problem in a finite-dimensional Euclidean space.
引用
收藏
页码:175 / 212
页数:38
相关论文
共 21 条
[1]   Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management [J].
Bielecki, T ;
Hernández-Hernández, D ;
Pliska, SR .
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1999, 50 (02) :167-188
[2]   Risk-sensitive dynamic asset management [J].
Bielecki, TR ;
Pliska, SR .
APPLIED MATHEMATICS AND OPTIMIZATION, 1999, 39 (03) :337-360
[3]  
Cavazos-Cadena R, 1999, MATH METHOD OPER RES, V49, P299
[4]   Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space:: An alternative approach [J].
Cavazos-Cadena, R ;
Hernández-Hernández, D .
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 56 (03) :473-479
[5]  
CAVAZOSCADENA R, 2001, MODELING UNCERTAINTY, P515
[6]  
CAVAZOSCADENA R, 2004, IN PRESS MATH METHOD
[7]  
Dembo A., 1993, Large deviations techniques and applications
[8]   Risk-sensitive control of discrete-time Markov processes with infinite horizon [J].
Di Masi, GB ;
Stettner, L .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 38 (01) :61-78
[9]   ASYMPTOTIC EVALUATION OF CERTAIN MARKOV PROCESS EXPECTATIONS FOR LARGE TIME, I [J].
DONSKER, MD ;
VARADHAN, SRS .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1975, 28 (01) :1-47
[10]   ASYMPTOTIC EVALUATION OF CERTAIN MARKOV PROCESS EXPECTATIONS FOR LARGE TIME .3. [J].
DONSKER, MD ;
VARADHAN, SRS .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1976, 29 (04) :389-461