Markov decision processes with recursive risk measures

被引:18
作者
Baeuerle, Nicole [1 ]
Glauner, Alexander [1 ]
机构
[1] Karlsruhe Inst Technol KIT, Dept Math, D-76128 Karlsruhe, Germany
关键词
Dynamic programming; Risk-sensitive Markov decision process; Risk measure; Robustness; TIME CONSISTENCY; MODEL;
D O I
10.1016/j.ejor.2021.04.030
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper, we consider risk-sensitive Markov Decision Processes (MDPs) with Borel state and action spaces and unbounded cost. We treat both finite and infinite planning horizons. Our optimality criterion is based on the recursive application of static risk measures. This is motivated by recursive utilities in the economic literature. It has been studied before for the entropic risk measure and is extended here to general static risk measures. Under direct assumptions on the model data we derive a Bellman equa-tion and prove the existence of optimal Markov policies. For an infinite planning horizon, the model is shown to be contractive and the optimal policy to be stationary. Our approach unifies results for a num-ber of well-known risk measures. Moreover, we establish a connection to distributionally robust MDPs, which provides a global interpretation of the recursively defined objective function. Monotone models are studied in particular. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:953 / 966
页数:14
相关论文
共 42 条
  • [1] Acciaio B, 2011, ADVANCED MATHEMATICAL METHODS FOR FINANCE, P1, DOI 10.1007/978-3-642-18412-3_1
  • [2] Aliprantis C.D., 2006, Infinite Dimensional Analysis. A Hitchhikers Guide, V3
  • [3] Asienkiewicz H., 2017, Applicationes Mathematicae (Warsaw), V44, P149
  • [4] Optimal risk allocation in reinsurance networks
    Baeuerle, Nicole
    Glauner, Alexander
    [J]. INSURANCE MATHEMATICS & ECONOMICS, 2018, 82 : 37 - 47
  • [5] Stochastic optimal growth model with risk sensitive preferences
    Baeuerle, Nicole
    Jaskiewicz, Anna
    [J]. JOURNAL OF ECONOMIC THEORY, 2018, 173 : 181 - 200
  • [6] Optimal dividend payout model with risk sensitive preferences
    Baeuerle, Nicole
    Jaskiewicz, Anna
    [J]. INSURANCE MATHEMATICS & ECONOMICS, 2017, 73 : 82 - 93
  • [7] Markov Decision Processes with Average-Value-at-Risk criteria
    Baeuerle, Nicole
    Ott, Jonathan
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2011, 74 (03) : 361 - 379
  • [8] Bäuerle N, 2011, UNIVERSITEXT, P1, DOI 10.1007/978-3-642-18324-9
  • [9] Bauerle N., 2020, ARXIV200713103
  • [10] A Unified Approach to Time Consistency of Dynamic Risk Measures and Dynamic Performance Measures in Discrete Time
    Bielecki, Tomasz R.
    Cialenco, Igor
    Pitera, Marcin
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2018, 43 (01) : 204 - 221