Optimal response vigor and choice under non-stationary outcome values

被引：0

作者：

Amir Dezfouli

Bernard W. Balleine

Richard Nock

机构：

[1] UNSW,School of Psychology

[2] Data61,undefined

[3] The Australian National University,undefined

[4] The University of Sydney,undefined

来源：

Psychonomic Bulletin & Review | 2019年 / 26卷

关键词：

Choice; Response vigor; Reward learning; Optimal actions;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Within a rational framework, a decision-maker selects actions based on the reward-maximization principle, which stipulates that they acquire outcomes with the highest value at the lowest cost. Action selection can be divided into two dimensions: selecting an action from various alternatives, and choosing its vigor, i.e., how fast the selected action should be executed. Both of these dimensions depend on the values of outcomes, which are often affected as more outcomes are consumed together with their associated actions. Despite this, previous research has only addressed the computational substrate of optimal actions in the specific condition that the values of outcomes are constant. It is not known what actions are optimal when the values of outcomes are non-stationary. Here, based on an optimal control framework, we derive a computational model for optimal actions when outcome values are non-stationary. The results imply that, even when the values of outcomes are changing, the optimal response rate is constant rather than decreasing. This finding shows that, in contrast to previous theories, commonly observed changes in action rate cannot be attributed solely to changes in outcome value. We then prove that this observation can be explained based on uncertainty about temporal horizons; e.g., the session duration. We further show that, when multiple outcomes are available, the model explains probability matching as well as maximization strategies. The model therefore provides a quantitative analysis of optimal action and explicit predictions for future testing.

引用

页码：182 / 204

页数：22

共 50 条

[1] Optimal response vigor and choice under non-stationary outcome values
Dezfouli, Amir
Balleine, Bernard W.
Nock, Richard
PSYCHONOMIC BULLETIN & REVIEW, 2019, 26 (01) : 182 - 204
[2] Meta-learning optimal parameter values in non-stationary environments
Sikora, Riyaz T.
KNOWLEDGE-BASED SYSTEMS, 2008, 21 (08) : 800 - 806
[3] Evaluation of the methods for response analysis under non-stationary excitation
Jangid, RS
Datta, TK
SHOCK AND VIBRATION, 1999, 6 (5-6) : 285 - 297
[4] Optimal encoding of non-stationary sources
Reif, JH
Storer, JA
INFORMATION SCIENCES, 2001, 135 (1-2) : 87 - 105
[5] Optimal Scheduling of Reservoir Flood Control under Non-Stationary Conditions
Mo, Chongxun
Jiang, Changhao
Lei, Xingbi
Cen, Weiyan
Yan, Zhiwei
Tang, Gang
Li, Lingguang
Sun, Guikai
Xing, Zhenxiang
SUSTAINABILITY, 2023, 15 (15)
[6] On the construction of the minimizing sequence of controls in the non-stationary problem of optimal response
Topunov, M.V.
Avtomatika i Telemekhanika, 2001, (08): : 56 - 60
[7] Non-stationary frequency response function
Rafael Blázquez
Juana Arias-Trujillo
Bulletin of Earthquake Engineering, 2013, 11 : 1895 - 1908
[8] Non-stationary frequency response function
Blazquez, Rafael
Arias-Trujillo, Juana
BULLETIN OF EARTHQUAKE ENGINEERING, 2013, 11 (06) : 1895 - 1908
[9] EXTREME VALUES OF NON-STATIONARY RANDOM SEQUENCES.
Huesler, Juerg
1600, (23):
[10] Combining p-Values in Non-Stationary Panels
Wu, Shaowen
Yin, Yong
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (06) : 1412 - 1431

← 1 2 3 4 5 →