Variability in Dopamine Genes Dissociates Model-Based and Model-Free Reinforcement Learning

被引:68
作者
Doll, Bradley B. [1 ,2 ]
Bath, Kevin G. [3 ]
Daw, Nathaniel D. [4 ,5 ]
Frank, Michael J. [3 ,6 ]
机构
[1] NYU, Ctr Neural Sci, 4 Washington PL, New York, NY 10003 USA
[2] Columbia Univ, Dept Psychol, New York, NY 10027 USA
[3] Brown Univ, Dept Cognit Linguist & Psychol Sci, Providence, RI 02912 USA
[4] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08540 USA
[5] Princeton Univ, Dept Psychol, Princeton, NJ 08540 USA
[6] Brown Univ, Brown Inst Brain Sci, Providence, RI 02912 USA
基金
美国国家科学基金会;
关键词
decision-making; dopamine; genetics; reinforcement learning; O-METHYLTRANSFERASE PHARMACOGENETICS; PREDICT INDIVIDUAL-DIFFERENCES; PREFRONTAL CORTEX; WORKING-MEMORY; REGULATED PHOSPHOPROTEIN; CATECHOLAMINE LEVELS; PHOSPHATASE CASCADE; STRIATAL DOPAMINE; MESSENGER-RNA; DARPP-32;
D O I
10.1523/JNEUROSCI.1901-15.2016
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Considerable evidence suggests that multiple learning systems can drive behavior. Choice can proceed reflexively from previous actions and their associated outcomes, as captured by "model-free" learning algorithms, or flexibly from prospective consideration of outcomes that might occur, as captured by "model-based" learning algorithms. However, differential contributions of dopamine to these systems are poorly understood. Dopamine is widely thought to support model-free learning by modulating plasticity in striatum. Model-based learning may also be affected by these striatal effects, or by other dopaminergic effects elsewhere, notably on prefrontal working memory function. Indeed, prominent demonstrations linking striatal dopamine to putatively model-free learning did not rule out model-based effects, whereas other studies have reported dopaminergic modulation of verifiably model-based learning, but without distinguishing a prefrontal versus striatal locus. To clarify the relationships between dopamine, neural systems, and learning strategies, we combine a genetic association approach in humans with two well-studied reinforcement learning tasks: one isolating model-based from model-free behavior and the other sensitive to key aspects of striatal plasticity. Prefrontal function was indexed by a polymorphism in the COMT gene, differences of which reflect dopamine levels in the prefrontal cortex. This polymorphism has been associated with differences in prefrontal activity and working memory. Striatal function was indexed by a gene coding for DARPP-32, which is densely expressed in the striatum where it is necessary for synaptic plasticity. We found evidence for our hypothesis that variations in prefrontal dopamine relate to model-based learning, whereas variations in striatal dopamine function relate to model-free learning.
引用
收藏
页码:1211 / 1222
页数:12
相关论文
共 74 条
[1]   EXTRACELLULAR CONCENTRATIONS OF DOPAMINE AND METABOLITES IN THE RAT CAUDATE AFTER ORAL-ADMINISTRATION OF A NOVEL CATECHOL-O-METHYLTRANSFERASE INHIBITOR RO 40-7592 [J].
ACQUAS, E ;
CARBONI, E ;
DEREE, RHA ;
DAPRADA, M ;
DICHIARA, G .
JOURNAL OF NEUROCHEMISTRY, 1992, 59 (01) :326-330
[2]  
[Anonymous], 1898, Psychol. Rev.
[3]   Random effects structure for confirmatory hypothesis testing: Keep it maximal [J].
Barr, Dale J. ;
Levy, Roger ;
Scheepers, Christoph ;
Tily, Harry J. .
JOURNAL OF MEMORY AND LANGUAGE, 2013, 68 (03) :255-278
[4]   Cell type-specific regulation of DARPP-32 phosphorylation by psychostimulant and antipsychotic drugs [J].
Bateup, Helen S. ;
Svenningsson, Per ;
Kuroiwa, Mahomi ;
Gong, Shiaoching ;
Nishi, Akinori ;
Heintz, Nathaniel ;
Greengard, Paul .
NATURE NEUROSCIENCE, 2008, 11 (08) :932-939
[5]   DARPP-32, A PHOSPHOPROTEIN ENRICHED IN DOPAMINOCEPTIVE NEURONS BEARING DOPAMINE D1 RECEPTORS - DISTRIBUTION IN THE CEREBRAL-CORTEX OF THE NEWBORN AND ADULT RHESUS-MONKEY [J].
BERGER, B ;
FEBVRET, A ;
GREENGARD, P ;
GOLDMANRAKIC, PS .
JOURNAL OF COMPARATIVE NEUROLOGY, 1990, 299 (03) :327-348
[6]   Dopamine and cAMP-regulated phosphoprotein 32 kDa controls both striatal long-term depression and long-term potentiation, opposing forms of synaptic plasticity [J].
Calabresi, P ;
Gubellini, P ;
Centonze, D ;
Picconi, B ;
Bernardi, G ;
Chergui, K ;
Svenningsson, P ;
Fienberg, AA ;
Greengard, P .
JOURNAL OF NEUROSCIENCE, 2000, 20 (22) :8443-8451
[7]   Experience-weighted attraction learning in normal form games [J].
Camerer, C ;
Ho, TH .
ECONOMETRICA, 1999, 67 (04) :827-874
[8]   Population stratification and spurious allelic association [J].
Cardon, LR ;
Palmer, LJ .
LANCET, 2003, 361 (9357) :598-604
[9]   A Reinforcement Learning Mechanism Responsible for the Valuation of Free Choice [J].
Cockburn, Jeffrey ;
Collins, Anne G. E. ;
Frank, Michael J. .
NEURON, 2014, 83 (03) :551-557
[10]   Opponent Actor Learning (OpAL): Modeling Interactive Effects of Striatal Dopamine on Reinforcement Learning and Choice Incentive [J].
Collins, Anne G. E. ;
Frank, Michael J. .
PSYCHOLOGICAL REVIEW, 2014, 121 (03) :337-366