NEURONAL REWARD AND DECISION SIGNALS: FROM THEORIES TO DATA

被引:695
作者
Schultz, Wolfram [1 ]
机构
[1] Univ Cambridge, Dept Physiol Dev & Neurosci, Cambridge CB2 3DY, England
基金
欧洲研究理事会; 芬兰科学院; 美国国家卫生研究院; 瑞士国家科学基金会; 英国惠康基金;
关键词
MIDBRAIN DOPAMINE NEURONS; VENTRAL TEGMENTAL AREA; TONICALLY ACTIVE NEURONS; NUCLEUS-ACCUMBENS DOPAMINE; ANTERIOR CINGULATE CORTEX; LONG-TERM POTENTIATION; FRONTAL EYE FIELD; POSTERIOR PARIETAL CORTEX; DORSOLATERAL PREFRONTAL CORTEX; TEMPORALLY DISCOUNTED VALUES;
D O I
10.1152/physrev.00023.2014
中图分类号
Q4 [生理学];
学科分类号
071003 ;
摘要
Rewards are crucial objects that induce learning, approach behavior, choices, and emotions. Whereas emotions are difficult to investigate in animals, the learning function is mediated by neuronal reward prediction error signals which implement basic constructs of reinforcement learning theory. These signals are found in dopamine neurons, which emit a global reward signal to striatum and frontal cortex, and in specific neurons in striatum, amygdala, and frontal cortex projecting to select neuronal populations. The approach and choice functions involve subjective value, which is objectively assessed by behavioral choices eliciting internal, subjective reward preferences. Utility is the formal mathematical characterization of subjective value and a prime decision variable in economic choice theory. It is coded as utility prediction error by phasic dopamine responses. Utility can incorporate various influences, including risk, delay, effort, and social interaction. Appropriate for formal decision mechanisms, rewards are coded as object value, action value, difference value, and chosen value by specific neurons. Although all reward, reinforcement, and decision variables are theoretical constructs, their neuronal signals constitute measurable physical implementations and as such confirm the validity of these concepts. The neuronal reward signals provide guidance for behavior while constraining the free will to act.
引用
收藏
页码:853 / 951
页数:99
相关论文
共 651 条
[81]   AVERSIVE STIMULUS DIFFERENTIALLY TRIGGERS SUBSECOND DOPAMINE RELEASE IN REWARD REGIONS [J].
Budygin, E. A. ;
Park, J. ;
Bass, C. E. ;
Grinevich, V. P. ;
Bonin, K. D. ;
Wightman, R. M. .
NEUROSCIENCE, 2012, 201 :331-337
[82]   ACUTE AND CHRONIC HALOPERIDOL TREATMENT - COMPARISON OF EFFECTS ON NIGRAL DOPAMINERGIC CELL ACTIVITY [J].
BUNNEY, BS ;
GRACE, AA .
LIFE SCIENCES, 1978, 23 (16) :1715-1727
[83]   Other-regarding preferences in a non-human primate: Common marmosets provision food altruistically [J].
Burkart, Judith M. ;
Fehr, Ernst ;
Efferson, Charles ;
van Schaik, Carel P. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (50) :19762-19766
[84]   Mirror neurons encode the subjective value of an observed action [J].
Caggiano, Vittorio ;
Fogassi, Leonardo ;
Rizzolatti, Giacomo ;
Casile, Antonino ;
Giese, Martin A. ;
Thier, Peter .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (29) :11848-11853
[85]   Contributions of Orbitofrontal and Lateral Prefrontal Cortices to Economic Choice and the Good-to-Action Transformation [J].
Cai, Xinying ;
Padoa-Schioppa, Camillo .
NEURON, 2014, 81 (05) :1140-1151
[86]   Neuronal Encoding of Subjective Value in Dorsal and Ventral Anterior Cingulate Cortex [J].
Cai, Xinying ;
Padoa-Schioppa, Camillo .
JOURNAL OF NEUROSCIENCE, 2012, 32 (11) :3791-3808
[87]   Heterogeneous Coding of Temporally Discounted Values in the Dorsal and Ventral Striatum during Intertemporal Choice [J].
Cai, Xinying ;
Kim, Soyoun ;
Lee, Daeyeol .
NEURON, 2011, 69 (01) :170-182
[88]   Dopamine and cAMP-regulated phosphoprotein 32 kDa controls both striatal long-term depression and long-term potentiation, opposing forms of synaptic plasticity [J].
Calabresi, P ;
Gubellini, P ;
Centonze, D ;
Picconi, B ;
Bernardi, G ;
Chergui, K ;
Svenningsson, P ;
Fienberg, AA ;
Greengard, P .
JOURNAL OF NEUROSCIENCE, 2000, 20 (22) :8443-8451
[89]  
Camerer C.F., 2003, Behavioral Game Theory: Experiments in Strategic Interaction
[90]   Double Dissociation of Stimulus-Value and Action-Value Learning in Humans with Orbitofrontal or Anterior Cingulate Cortex Damage [J].
Camille, Nathalie ;
Tsuchida, Ami ;
Fellows, Lesley K. .
JOURNAL OF NEUROSCIENCE, 2011, 31 (42) :15048-15052