NEURONAL REWARD AND DECISION SIGNALS: FROM THEORIES TO DATA

被引:695
作者
Schultz, Wolfram [1 ]
机构
[1] Univ Cambridge, Dept Physiol Dev & Neurosci, Cambridge CB2 3DY, England
基金
欧洲研究理事会; 芬兰科学院; 美国国家卫生研究院; 瑞士国家科学基金会; 英国惠康基金;
关键词
MIDBRAIN DOPAMINE NEURONS; VENTRAL TEGMENTAL AREA; TONICALLY ACTIVE NEURONS; NUCLEUS-ACCUMBENS DOPAMINE; ANTERIOR CINGULATE CORTEX; LONG-TERM POTENTIATION; FRONTAL EYE FIELD; POSTERIOR PARIETAL CORTEX; DORSOLATERAL PREFRONTAL CORTEX; TEMPORALLY DISCOUNTED VALUES;
D O I
10.1152/physrev.00023.2014
中图分类号
Q4 [生理学];
学科分类号
071003 ;
摘要
Rewards are crucial objects that induce learning, approach behavior, choices, and emotions. Whereas emotions are difficult to investigate in animals, the learning function is mediated by neuronal reward prediction error signals which implement basic constructs of reinforcement learning theory. These signals are found in dopamine neurons, which emit a global reward signal to striatum and frontal cortex, and in specific neurons in striatum, amygdala, and frontal cortex projecting to select neuronal populations. The approach and choice functions involve subjective value, which is objectively assessed by behavioral choices eliciting internal, subjective reward preferences. Utility is the formal mathematical characterization of subjective value and a prime decision variable in economic choice theory. It is coded as utility prediction error by phasic dopamine responses. Utility can incorporate various influences, including risk, delay, effort, and social interaction. Appropriate for formal decision mechanisms, rewards are coded as object value, action value, difference value, and chosen value by specific neurons. Although all reward, reinforcement, and decision variables are theoretical constructs, their neuronal signals constitute measurable physical implementations and as such confirm the validity of these concepts. The neuronal reward signals provide guidance for behavior while constraining the free will to act.
引用
收藏
页码:853 / 951
页数:99
相关论文
共 651 条
[1]   Distributed Coding of Actual and Hypothetical Outcomes in the Orbital and Dorsolateral Prefrontal Cortex [J].
Abe, Hiroshi ;
Lee, Daeyeol .
NEURON, 2011, 70 (04) :731-741
[2]   Optogenetic Interrogation of Dopaminergic Modulation of the Multiple Phases of Reward-Seeking Behavior [J].
Adamantidis, Antoine R. ;
Tsai, Hsing-Chen ;
Boutrel, Benjamin ;
Zhang, Feng ;
Stuber, Garret D. ;
Budygin, Evgeny A. ;
Tourino, Clara ;
Bonci, Antonello ;
Deisseroth, Karl ;
de Lecea, Luis .
JOURNAL OF NEUROSCIENCE, 2011, 31 (30) :10829-10835
[3]   The impulses produced by sensory nerve endings. Part 3. Impulses set up by touch and pressure. [J].
Adrian, ED ;
Zotterman, Y .
JOURNAL OF PHYSIOLOGY-LONDON, 1926, 61 (04) :465-483
[4]   SPECIOUS REWARD - BEHAVIORAL THEORY OF IMPULSIVENESS AND IMPULSE CONTROL [J].
AINSLIE, G .
PSYCHOLOGICAL BULLETIN, 1975, 82 (04) :463-496
[5]   IMPULSE CONTROL IN PIGEONS [J].
AINSLIE, GW .
JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 1974, 21 (03) :485-489
[6]   PARALLEL ORGANIZATION OF FUNCTIONALLY SEGREGATED CIRCUITS LINKING BASAL GANGLIA AND CORTEX [J].
ALEXANDER, GE ;
DELONG, MR ;
STRICK, PL .
ANNUAL REVIEW OF NEUROSCIENCE, 1986, 9 :357-381
[7]   NEURAL REPRESENTATIONS OF THE TARGET (GOAL) OF VISUALLY GUIDED ARM MOVEMENTS IN 3 MOTOR AREAS OF THE MONKEY [J].
ALEXANDER, GE ;
CRUTCHER, MD .
JOURNAL OF NEUROPHYSIOLOGY, 1990, 64 (01) :164-178
[8]   Reward-predicting and reward-detecting neuronal activity in the primate supplementary eye field [J].
Amador, N ;
Schlag-Rey, M ;
Schlag, J .
JOURNAL OF NEUROPHYSIOLOGY, 2000, 84 (04) :2166-2170
[9]   Basolateral amygdala neurons facilitate reward-seeking behavior by exciting nucleus accumbens neurons [J].
Ambroggi, Frederic ;
Ishikawa, Alkinori ;
Fields, Howard L. ;
Nicola, Saleem M. .
NEURON, 2008, 59 (04) :648-661
[10]   Localized microstimulation of primate pregenual cingulate cortex induces negative decision-making [J].
Amemori, Ken-ichi ;
Graybiel, Ann M. .
NATURE NEUROSCIENCE, 2012, 15 (05) :776-785