RISK-SENSITIVE DECISION MAKING VIA CONSTRAINED EXPECTED RETURNS

被引:0
|
作者
Hahn, Juergen [1 ]
Zoubir, Abdelhak M. [1 ]
机构
[1] Tech Univ Darmstadt, Signal Proc Grp, Merckstr 25, D-64283 Darmstadt, Germany
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
Markov decision process; Risk; Decision making; Constrained optimization; Reinforcement Learning; REINFORCEMENT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Decision making based on Markov decision processes (MDPs) is an emerging research area as MDPs provide a convenient formalism to learn an optimal behavior in terms of a given reward. In many applications there are critical states that might harm the agent or the environment and should therefore be avoided. In practice, those states are often simply penalized with a negative reward where the penalty is set in a trial-anderror approach. For this reason, we propose a modification of the well-known value iteration algorithm that guarantees that critical states are visited with a pre-set probability only. Since this leads to an infeasible problem, we investigate the effect of nonlinear and linear approximations and discuss the effects. Two examples demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:2569 / 2573
页数:5
相关论文
共 50 条
  • [21] How Does Variable Response Effort Influence Risk-Sensitive Decision Making in Pigeons?
    Baetz-Dougan, Madelaine
    Troje, Nikolaus F.
    CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2014, 68 (04): : 279 - 279
  • [22] RISK-SENSITIVE DECISION MAKING AND SELF-HARM AMONG YOUTH WITH BIPOLAR DISORDER
    Dimick, Mikaela K.
    Sultan, Alysha A.
    Kennedy, Kody G.
    Goldstein, Benjamin I.
    JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2022, 61 (10): : S216 - S216
  • [23] Risk-Sensitive Decision-Making in Patients with Posterior Parietal and Ventromedial Prefrontal Cortex Injury
    Studer, Bettina
    Manes, Facundo
    Humphreys, Glyn
    Robbins, Trevor W.
    Clark, Luke
    CEREBRAL CORTEX, 2015, 25 (01) : 1 - 9
  • [24] Risk-sensitive decision support system for tunnel construction
    Likhitruangsilp, V
    Ioannou, PG
    GEOTECHNICAL ENGINEERING FOR TRANSPORTATION PROJECTS, VOL 2, 2004, (126): : 1508 - 1515
  • [25] Markov decision processes with risk-sensitive criteria: an overview
    Nicole Bäuerle
    Anna Jaśkiewicz
    Mathematical Methods of Operations Research, 2024, 99 : 141 - 178
  • [26] Verification of Markov Decision Processes with Risk-Sensitive Measures
    Cubuktepe, Murat
    Topcu, Ufuk
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 2371 - 2377
  • [27] Markov decision processes with risk-sensitive criteria: an overview
    Baeuerle, Nicole
    Jaskiewicz, Anna
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2024, 99 (1-2) : 141 - 178
  • [28] Risk-Sensitive Markov Decision Process with Limited Budget
    de Melo Moreira, Daniel Augusto
    Delgado, Karina Valdivia
    de Barros, Leliane Nunes
    2017 6TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2017, : 109 - 114
  • [29] RISK-SENSITIVE AVERAGE OPTIMALITY IN MARKOV DECISION PROCESSES
    Sladky, Karel
    KYBERNETIKA, 2018, 54 (06) : 1218 - 1230
  • [30] On Risk-Sensitive Piecewise Deterministic Markov Decision Processes
    Guo, Xin
    Zhang, Yi
    APPLIED MATHEMATICS AND OPTIMIZATION, 2020, 81 (03): : 685 - 710