Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control

被引:1581
作者
Daw, ND
Niv, Y
Dayan, P
机构
[1] UCL, Gatsby Computat Neurosci Unit, London WC1N 3AR, England
[2] Hebrew Univ Jerusalem, Interdisciplinary Ctr Neural Computat, IL-91904 Jerusalem, Israel
关键词
D O I
10.1038/nn1560
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
A broad range of neural and behavioral data suggests that the brain contains multiple systems for behavioral choice, including one associated with prefrontal cortex and another with dorsolateral striatum. However, such a surfeit of control raises an additional choice problem: how to arbitrate between the systems when they disagree. Here, we consider dual-action choice systems from a normative perspective, using the computational theory of reinforcement learning. We identify a key trade-off pitting computational simplicity against the flexible and statistically efficient use of experience. The trade-off is realized in a competition between the dorsolateral striatal and prefrontal systems. We suggest a Bayesian principle of arbitration between them according to uncertainty, so each controller is deployed when it should be most accurate. This provides a unifying account of a wealth of experimental evidence about the factors favoring dominance by either system.
引用
收藏
页码:1704 / 1711
页数:8
相关论文
共 50 条
[1]   VARIATIONS IN THE SENSITIVITY OF INSTRUMENTAL RESPONDING TO REINFORCER DEVALUATION [J].
ADAMS, CD .
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION B-COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1982, 34 (MAY) :77-98
[2]   PARALLEL ORGANIZATION OF FUNCTIONALLY SEGREGATED CIRCUITS LINKING BASAL GANGLIA AND CORTEX [J].
ALEXANDER, GE ;
DELONG, MR ;
STRICK, PL .
ANNUAL REVIEW OF NEUROSCIENCE, 1986, 9 :357-381
[3]   MOTIVATIONAL CONTROL OF HETEROGENEOUS INSTRUMENTAL CHAINS [J].
BALLEINE, BW ;
GARNER, C ;
GONZALEZ, F ;
DICKINSON, A .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL BEHAVIOR PROCESSES, 1995, 21 (03) :203-217
[4]   Goal-directed instrumental action: contingency and incentive learning and their cortical substrates [J].
Balleine, BW ;
Dickinson, A .
NEUROPHARMACOLOGY, 1998, 37 (4-5) :407-419
[5]  
Balleine BW, 2000, J NEUROSCI, V20, P8954
[6]   A Bayesian approach to relevance in game playing [J].
Baum, EB ;
Smith, WD .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :195-242
[7]  
Blundell P, 2003, J NEUROSCI, V23, P7702
[8]   Conflict monitoring and anterior cingulate cortex: an update [J].
Botvinick, Matthew M. ;
Cohen, Jonathan D. ;
Carter, Cameron S. .
TRENDS IN COGNITIVE SCIENCES, 2004, 8 (12) :539-546
[9]   A computational model of parallel navigation systems in rodents [J].
Chavarriaga, R ;
Strösslin, T ;
Sheynikhovich, D ;
Gerstner, W .
NEUROINFORMATICS, 2005, 3 (03) :223-241
[10]   INSTRUMENTAL RESPONDING REMAINS SENSITIVE TO REINFORCER DEVALUATION AFTER EXTENSIVE TRAINING [J].
COLWILL, RM ;
RESCORLA, RA .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL BEHAVIOR PROCESSES, 1985, 11 (04) :520-536