Markov decision processes with risk-sensitive criteria: an overview

被引:2
作者
Baeuerle, Nicole [1 ]
Jaskiewicz, Anna [2 ]
机构
[1] Karlsruhe Inst Technol KIT, Dept Math, D-76131 Karlsruhe, Germany
[2] Wroclaw Univ Sci & Technol, Fac Pure & Appl Math, Wroclaw, Poland
关键词
Markov decision process; Risk-sensitive decision; Optimized certainty equivalent; Optimal policy; MULTISTAGE STOCHASTIC PROGRAMS; VALUE-AT-RISK; INFINITE-HORIZON RISK; OPTIMAL-GROWTH MODEL; DISCRETE-TIME; AVERAGE OPTIMALITY; PRECAUTIONARY SAVINGS; STATIONARY POLICIES; VALUE-ITERATION; AVERSE CONTROL;
D O I
10.1007/s00186-024-00857-0
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The paper provides an overview of the theory and applications of risk-sensitive Markov decision processes. The term 'risk-sensitive' refers here to the use of the Optimized Certainty Equivalent as a means to measure expectation and risk. This comprises the well-known entropic risk measure and Conditional Value-at-Risk. We restrict our considerations to stationary problems with an infinite time horizon. Conditions are given under which optimal policies exist and solution procedures are explained. We present both the theory when the Optimized Certainty Equivalent is applied recursively as well as the case where it is applied to the cumulated reward. Discounted as well as non-discounted models are reviewed.
引用
收藏
页码:141 / 178
页数:38
相关论文
共 174 条
  • [1] Coherent risk measures in inventory problems
    Ahmed, Shabbir
    Cakmak, Ulas
    Shapiro, Alexander
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 182 (01) : 226 - 238
  • [2] A VARIATIONAL FORMULA FOR RISK-SENSITIVE REWARD
    Anantharam, V.
    Borkar, V. S.
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2017, 55 (02) : 961 - 988
  • [3] The dynamics of risk-sensitive allocations
    Anderson, EW
    [J]. JOURNAL OF ECONOMIC THEORY, 2005, 125 (02) : 93 - 150
  • [4] [Anonymous], 2012, Proc. Advances in Neural Information Processing Systems (NeurIPS)
  • [5] Linear and Dynamic Programs for Risk-Sensitive Cost Minimization
    Arapostathis, Ari
    Borkar, Vivek S.
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 3042 - 3047
  • [6] Risk-Sensitive Control and an Abstract Collatz-Wielandt Formula
    Arapostathis, Ari
    Borkar, Vivek S.
    Kumar, K. Suresh
    [J]. JOURNAL OF THEORETICAL PROBABILITY, 2016, 29 (04) : 1458 - 1484
  • [7] Arrow K J., 1971, Essays in the Theory of Risk-Bearing
  • [8] Asienkiewicz H., 2017, Applicationes Mathematicae (Warsaw), V44, P149
  • [9] Distributionally Robust Markov Decision Processes and Their Connection to Risk Measures
    Baeuerle, Nicole
    Glauner, Alexander
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2021, 47 (03) : 1757 - 1780
  • [10] Stochastic optimal growth model with risk sensitive preferences
    Baeuerle, Nicole
    Jaskiewicz, Anna
    [J]. JOURNAL OF ECONOMIC THEORY, 2018, 173 : 181 - 200