Neural mechanisms of acquired phasic dopamine responses in learning

被引:68
作者
Hazy, Thomas E. [3 ]
Frank, Michael J. [1 ,2 ]
O'Reilly, Randall C. [3 ]
机构
[1] Brown Univ, Dept Psychol, Providence, RI 02912 USA
[2] Brown Univ, Dept Cognit & Linguist Sci, Providence, RI 02912 USA
[3] Univ Colorado, Dept Psychol & Neurosci, Boulder, CO 80309 USA
关键词
Dopamine; Learning; Conditioning; Basal ganglia; Computational model; NUCLEUS-ACCUMBENS DOPAMINE; MEDIAL PREFRONTAL CORTEX; BASAL GANGLIA; WORKING-MEMORY; LATERAL HABENULA; COMPARTMENTAL ORGANIZATION; ORBITOFRONTAL CORTEX; NEURONAL-ACTIVITY; SUBSTANTIA-NIGRA; CENTRAL AMYGDALA;
D O I
10.1016/j.neubiorev.2009.11.019
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
What biological mechanisms underlie the reward-predictive firing properties of midbrain dopaminergic neurons, and how do they relate to the complex constellation of empirical findings understood as Pavlovian and instrumental conditioning? We previously presented PVLV, a biologically inspired Pavlovian learning algorithm accounting for DA activity in terms of two interrelated systems: a primary value (PV) system, which governs how DA cells respond to a US (reward) and; a learned value (LV) system, which governs how DA cells respond to a CS. Here, we provide a more extensive review of the biological mechanisms supporting phasic DA firing and their relation to the spate of Pavlovian conditioning phenomena and their sensitivity to focal brain lesions. We further extend the model by incorporating a new NV (novelty value) component reflecting the ability of novel stimuli to trigger phasic DA firing, providing "novelty bonuses" which encourages exploratory working memory updating and in turn speeds learning in trace conditioning and other working memory-dependent paradigms. The evolving PVLV model builds upon insights developed in many earlier computational models, especially reinforcement learning models based on the ideas of Sutton and Barto, biological models, and the psychological model developed by Savastano and Miller. The PVLV framework synthesizes these various approaches, overcoming important shortcomings of each by providing a coherent and specific mapping to much of the relevant empirical data at both the micro- and macro-levels, and examines their relevance for higher order cognitive functions. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:701 / 720
页数:20
相关论文
共 181 条
[1]   Independent modulation of basal and feeding-evoked dopamine efflux in the nucleus accumbens and medial prefrontal cortex by the central and basolateral amygdalar nuclei in the rat [J].
Ahn, S ;
Phillips, AG .
NEUROSCIENCE, 2003, 116 (01) :295-305
[2]  
AMARAL D G, 1992, P1
[3]   Instrumental learning, but not performance, requires dopamine d1-receptor activation in the amygdala [J].
Andrzejewski, ME ;
Spencer, RC ;
Kelley, AE .
NEUROSCIENCE, 2005, 135 (02) :335-345
[4]  
[Anonymous], MODELS INFORM PROCES
[5]  
[Anonymous], 2000, The Amygdala: A functional analysis
[6]  
[Anonymous], 1967, S PUN
[7]  
[Anonymous], 1941, Conditioned reflexes and psychiatry
[8]   NEURONAL-ACTIVITY IN MONKEY STRIATUM RELATED TO THE EXPECTATION OF PREDICTABLE ENVIRONMENTAL EVENTS [J].
APICELLA, P ;
SCARNATI, E ;
LJUNGBERG, T ;
SCHULTZ, W .
JOURNAL OF NEUROPHYSIOLOGY, 1992, 68 (03) :945-960
[9]   Role of lateral hypothalamic orexin neurons in reward processing and addiction [J].
Aston-Jones, Gary ;
Smith, Rachel J. ;
Moorman, David E. ;
Richardson, Kimberlei A. .
NEUROPHARMACOLOGY, 2009, 56 :112-121
[10]   Suppression of cortical representation through backward conditioning [J].
Bao, SW ;
Chan, VT ;
Zhang, LI ;
Merzenich, MM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (03) :1405-1408