Optimal policy for value-based decision-making

被引:120
作者
Tajima, Satohiro [1 ]
Drugowitsch, Jan [1 ,2 ]
Pouget, Alexandre [1 ,3 ,4 ]
机构
[1] Univ Geneva, Dept Neurosci Fondamentales, Rue Michel Servet 1, CH-1211 Geneva, Switzerland
[2] Harvard Med Sch, Dept Neurobiol, 220 Longwood Ave, Boston, MA 02115 USA
[3] Univ Rochester, Dept Brain & Cognit Sci, Rochester, NY 14627 USA
[4] UCL, Gatsby Computat Neurosci Unit, London, England
来源
NATURE COMMUNICATIONS | 2016年 / 7卷
基金
瑞士国家科学基金会;
关键词
DRIFT-DIFFUSION MODEL; VISUAL FIXATIONS;
D O I
10.1038/ncomms12400
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
For decades now, normative theories of perceptual decisions, and their implementation as drift diffusion models, have driven and significantly improved our understanding of human and animal behaviour and the underlying neural processes. While similar processes seem to govern value-based decisions, we still lack the theoretical understanding of why this ought to be the case. Here, we show that, similar to perceptual decisions, drift diffusion models implement the optimal strategy for value-based decisions. Such optimal decisions require the models' decision boundaries to collapse over time, and to depend on the a priori knowledge about reward contingencies. Diffusion models only implement the optimal strategy under specific task assumptions, and cease to be optimal once we start relaxing these assumptions, by, for example, using non-linear utility functions. Our findings thus provide the much-needed theory for value-based decisions, explain the apparent similarity to perceptual decisions, and predict conditions under which this similarity should break down.
引用
收藏
页数:12
相关论文
共 30 条
[11]   Neural computations that underlie decisions about sensory stimuli [J].
Gold, JI ;
Shadlen, MN .
TRENDS IN COGNITIVE SCIENCES, 2001, 5 (01) :10-16
[12]   Revisiting the Evidence for Collapsing Boundaries and Urgency Signals in Perceptual Decision-Making [J].
Hawkins, Guy E. ;
Forstmann, Birte U. ;
Wagenmakers, Eric-Jan ;
Ratcliff, Roger ;
Brown, Scott D. .
JOURNAL OF NEUROSCIENCE, 2015, 35 (06) :2476-2484
[13]   Neural correlates of a decision in the dorsolateral prefrontal cortex of the macaque [J].
Kim, JN ;
Shadlen, MN .
NATURE NEUROSCIENCE, 1999, 2 (02) :176-185
[14]   A Neural Implementation of Wald's Sequential Probability Ratio Test [J].
Kira, Shinichiro ;
Yang, Tianming ;
Shadlen, Michael N. .
NEURON, 2015, 85 (04) :861-873
[15]   Multialternative drift-diffusion model predicts the relationship between visual fixations and choice in value-based decisions [J].
Krajbich, Ian ;
Rangel, Antonio .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (33) :13852-13857
[16]   Visual fixations and the computation and comparison of value in simple choice [J].
Krajbich, Ian ;
Armel, Carrie ;
Rangel, Antonio .
NATURE NEUROSCIENCE, 2010, 13 (10) :1292-1298
[17]   SEQUENTIAL THEORY OF PSYCHOLOGICAL DISCRIMINATION [J].
LINK, SW ;
HEATH, RA .
PSYCHOMETRIKA, 1975, 40 (01) :77-105
[18]   Normalization is a general neural mechanism for context-dependent decision making [J].
Louie, Kenway ;
Khaw, Mel W. ;
Glimcher, Paul W. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (15) :6139-6144
[19]   Average reward reinforcement learning: Foundations, algorithms, and empirical results [J].
Mahadevan, S .
MACHINE LEARNING, 1996, 22 (1-3) :159-195
[20]  
Milosavljevic M, 2010, JUDGM DECIS MAK, V5, P437