Optimal response vigor and choice under non-stationary outcome values

被引:0
|
作者
Amir Dezfouli
Bernard W. Balleine
Richard Nock
机构
[1] UNSW,School of Psychology
[2] Data61,undefined
[3] The Australian National University,undefined
[4] The University of Sydney,undefined
来源
关键词
Choice; Response vigor; Reward learning; Optimal actions;
D O I
暂无
中图分类号
学科分类号
摘要
Within a rational framework, a decision-maker selects actions based on the reward-maximization principle, which stipulates that they acquire outcomes with the highest value at the lowest cost. Action selection can be divided into two dimensions: selecting an action from various alternatives, and choosing its vigor, i.e., how fast the selected action should be executed. Both of these dimensions depend on the values of outcomes, which are often affected as more outcomes are consumed together with their associated actions. Despite this, previous research has only addressed the computational substrate of optimal actions in the specific condition that the values of outcomes are constant. It is not known what actions are optimal when the values of outcomes are non-stationary. Here, based on an optimal control framework, we derive a computational model for optimal actions when outcome values are non-stationary. The results imply that, even when the values of outcomes are changing, the optimal response rate is constant rather than decreasing. This finding shows that, in contrast to previous theories, commonly observed changes in action rate cannot be attributed solely to changes in outcome value. We then prove that this observation can be explained based on uncertainty about temporal horizons; e.g., the session duration. We further show that, when multiple outcomes are available, the model explains probability matching as well as maximization strategies. The model therefore provides a quantitative analysis of optimal action and explicit predictions for future testing.
引用
收藏
页码:182 / 204
页数:22
相关论文
共 50 条
  • [1] Optimal response vigor and choice under non-stationary outcome values
    Dezfouli, Amir
    Balleine, Bernard W.
    Nock, Richard
    PSYCHONOMIC BULLETIN & REVIEW, 2019, 26 (01) : 182 - 204
  • [2] Meta-learning optimal parameter values in non-stationary environments
    Sikora, Riyaz T.
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (08) : 800 - 806
  • [3] Evaluation of the methods for response analysis under non-stationary excitation
    Jangid, RS
    Datta, TK
    SHOCK AND VIBRATION, 1999, 6 (5-6) : 285 - 297
  • [4] Optimal encoding of non-stationary sources
    Reif, JH
    Storer, JA
    INFORMATION SCIENCES, 2001, 135 (1-2) : 87 - 105
  • [5] Optimal Scheduling of Reservoir Flood Control under Non-Stationary Conditions
    Mo, Chongxun
    Jiang, Changhao
    Lei, Xingbi
    Cen, Weiyan
    Yan, Zhiwei
    Tang, Gang
    Li, Lingguang
    Sun, Guikai
    Xing, Zhenxiang
    SUSTAINABILITY, 2023, 15 (15)
  • [6] On the construction of the minimizing sequence of controls in the non-stationary problem of optimal response
    Topunov, M.V.
    Avtomatika i Telemekhanika, 2001, (08): : 56 - 60
  • [7] Non-stationary frequency response function
    Rafael Blázquez
    Juana Arias-Trujillo
    Bulletin of Earthquake Engineering, 2013, 11 : 1895 - 1908
  • [8] Non-stationary frequency response function
    Blazquez, Rafael
    Arias-Trujillo, Juana
    BULLETIN OF EARTHQUAKE ENGINEERING, 2013, 11 (06) : 1895 - 1908
  • [9] EXTREME VALUES OF NON-STATIONARY RANDOM SEQUENCES.
    Huesler, Juerg
    1600, (23):
  • [10] Combining p-Values in Non-Stationary Panels
    Wu, Shaowen
    Yin, Yong
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (06) : 1412 - 1431