Markov decision processes with recursive risk measures

被引：18

作者：

Baeuerle, Nicole ^{[1
]}

Glauner, Alexander ^{[1
]}

机构：

[1] Karlsruhe Inst Technol KIT, Dept Math, D-76128 Karlsruhe, Germany

来源：

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH | 2022年 / 296卷 / 03期

关键词：

Dynamic programming; Risk-sensitive Markov decision process; Risk measure; Robustness; TIME CONSISTENCY; MODEL;

D O I：

10.1016/j.ejor.2021.04.030

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper, we consider risk-sensitive Markov Decision Processes (MDPs) with Borel state and action spaces and unbounded cost. We treat both finite and infinite planning horizons. Our optimality criterion is based on the recursive application of static risk measures. This is motivated by recursive utilities in the economic literature. It has been studied before for the entropic risk measure and is extended here to general static risk measures. Under direct assumptions on the model data we derive a Bellman equa-tion and prove the existence of optimal Markov policies. For an infinite planning horizon, the model is shown to be contractive and the optimal policy to be stationary. Our approach unifies results for a num-ber of well-known risk measures. Moreover, we establish a connection to distributionally robust MDPs, which provides a global interpretation of the recursively defined objective function. Monotone models are studied in particular. (c) 2021 Elsevier B.V. All rights reserved.

引用

页码：953 / 966

页数：14

共 42 条

[1] Acciaio B, 2011, ADVANCED MATHEMATICAL METHODS FOR FINANCE, P1, DOI 10.1007/978-3-642-18412-3_1
[2] Aliprantis C.D., 2006, Infinite Dimensional Analysis. A Hitchhikers Guide, V3
[3] Asienkiewicz H., 2017, Applicationes Mathematicae (Warsaw), V44, P149
[4] Optimal risk allocation in reinsurance networks
Baeuerle, Nicole
Glauner, Alexander
[J]. INSURANCE MATHEMATICS & ECONOMICS, 2018, 82 : 37 - 47
[5] Stochastic optimal growth model with risk sensitive preferences
Baeuerle, Nicole
Jaskiewicz, Anna
[J]. JOURNAL OF ECONOMIC THEORY, 2018, 173 : 181 - 200
[6] Optimal dividend payout model with risk sensitive preferences
Baeuerle, Nicole
Jaskiewicz, Anna
[J]. INSURANCE MATHEMATICS & ECONOMICS, 2017, 73 : 82 - 93
[7] Markov Decision Processes with Average-Value-at-Risk criteria
Baeuerle, Nicole
Ott, Jonathan
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2011, 74 (03) : 361 - 379
[8] Bäuerle N, 2011, UNIVERSITEXT, P1, DOI 10.1007/978-3-642-18324-9
[9] Bauerle N., 2020, ARXIV200713103
[10] A Unified Approach to Time Consistency of Dynamic Risk Measures and Dynamic Performance Measures in Discrete Time
Bielecki, Tomasz R.
Cialenco, Igor
Pitera, Marcin
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2018, 43 (01) : 204 - 221

← 1 2 3 4 5 →