RISK-SENSITIVE DECISION MAKING VIA CONSTRAINED EXPECTED RETURNS

被引:0
|
作者
Hahn, Juergen [1 ]
Zoubir, Abdelhak M. [1 ]
机构
[1] Tech Univ Darmstadt, Signal Proc Grp, Merckstr 25, D-64283 Darmstadt, Germany
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
Markov decision process; Risk; Decision making; Constrained optimization; Reinforcement Learning; REINFORCEMENT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Decision making based on Markov decision processes (MDPs) is an emerging research area as MDPs provide a convenient formalism to learn an optimal behavior in terms of a given reward. In many applications there are critical states that might harm the agent or the environment and should therefore be avoided. In practice, those states are often simply penalized with a negative reward where the penalty is set in a trial-anderror approach. For this reason, we propose a modification of the well-known value iteration algorithm that guarantees that critical states are visited with a pre-set probability only. Since this leads to an infeasible problem, we investigate the effect of nonlinear and linear approximations and discuss the effects. Two examples demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:2569 / 2573
页数:5
相关论文
共 50 条
  • [1] For the Win: Risk-Sensitive Decision-Making in Teams
    Gonzales, Josh
    Mishra, Sandeep
    Camp, Ronald D., II
    JOURNAL OF BEHAVIORAL DECISION MAKING, 2017, 30 (02) : 462 - 472
  • [2] Approximate solutions to constrained risk-sensitive Markov decision processes
    Kumar, Uday M.
    Bhat, Sanjay P.
    Kavitha, Veeraruna
    Hemachandra, Nandyala
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 310 (01) : 249 - 267
  • [3] Psychophysiological arousal and inter- and intraindividual differences in risk-sensitive decision making
    Studer, Bettina
    Scheibehenne, Benjamin
    Clark, Luke
    PSYCHOPHYSIOLOGY, 2016, 53 (06) : 940 - 950
  • [4] Risk-Sensitive Assessment of Decision-Making Capacity: A Comprehensive Defense
    Kim, Scott Y. H.
    Berens, Noah C.
    HASTINGS CENTER REPORT, 2023, 53 (04) : 30 - 43
  • [5] Gambling for Gatorade: risk-sensitive decision making for fluid rewards in humans
    Hayden, Benjamin Y.
    Platt, Michael L.
    ANIMAL COGNITION, 2009, 12 (01) : 201 - 207
  • [6] Gambling for Gatorade: risk-sensitive decision making for fluid rewards in humans
    Benjamin Y. Hayden
    Michael L. Platt
    Animal Cognition, 2009, 12 : 201 - 207
  • [7] Risk-Sensitive Decision-Making in Patients with Posterior Parietal and Ventromedial Prefrontal Cortex Injury
    Studer, Bettina
    Manes, Facundo
    Humphreys, Glyn
    Robbins, Trevor W.
    Clark, Luke
    CEREBRAL CORTEX, 2015, 25 (01) : 1 - 9
  • [8] Markov decision processes with risk-sensitive criteria: an overview
    Nicole Bäuerle
    Anna Jaśkiewicz
    Mathematical Methods of Operations Research, 2024, 99 : 141 - 178
  • [9] Markov decision processes with risk-sensitive criteria: an overview
    Baeuerle, Nicole
    Jaskiewicz, Anna
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2024, 99 (1-2) : 141 - 178
  • [10] Symbolic Dynamic Programming for Risk-sensitive Markov Decision Process with limited budget
    Moreira, Daniel A. M.
    Delgado, Karina Valdivia
    de Barros, Leliane Nunes
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 2019 - 2021