Temporal filtering of reward signals in the dorsal anterior cingulate cortex during a mixed-strategy game

被引:223
作者
Seo, Hyojung [1 ]
Lee, Daeyeol [1 ]
机构
[1] Yale Univ, Sch Med, Dept Neurobiol, New Haven, CT 06510 USA
关键词
reinforcement learning; game theory; neuroeconomics; decision; dopamine; reward;
D O I
10.1523/JNEUROSCI.2369-07.2007
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The process of decision making in humans and other animals is adaptive and can be tuned through experience so as to optimize the outcomes of their choices in a dynamic environment. Previous studies have demonstrated that the anterior cingulate cortex plays an important role in updating the animal's behavioral strategies when the action outcome contingencies change. Moreover, neurons in the anterior cingulate cortex often encode the signals related to expected or actual reward. We investigated whether reward-related activity in the anterior cingulate cortex is affected by the animal's previous reward history. This was tested in rhesus monkeys trained to make binary choices in a computer-simulated competitive zero-sum game. The animal's choice behavior was relatively close to the optimal strategy but also revealed small systematic biases that are consistent with the use of a reinforcement learning algorithm. In addition, the activity of neurons in the dorsal anterior cingulate cortex that was related to the reward received by the animal in a given trial often was modulated by the rewards in the previous trials. Some of these neurons encoded the rate of rewards in previous trials, whereas others displayed activity modulations more closely related to the reward prediction errors. In contrast, signals related to the animal's choices were represented only weakly in this cortical area. These results suggest that neurons in the dorsal anterior cingulate cortex might be involved in the subjective evaluation of choice outcomes based on the animal's reward history.
引用
收藏
页码:8366 / 8377
页数:12
相关论文
共 56 条
[1]   Anterior cingulate error-related activity is modulated by predicted reward [J].
Amiez, C ;
Joseph, JP ;
Procyk, E .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2005, 21 (12) :3447-3452
[2]   Reward encoding in the monkey anterior cingulate cortex [J].
Amiez, C. ;
Joseph, J. P. ;
Procyk, E. .
CEREBRAL CORTEX, 2006, 16 (07) :1040-1055
[3]   An integrative theory of locus coeruleus-norepinephrine function: Adaptive gain and optimal performance [J].
Aston-Jones, G ;
Cohen, JD .
ANNUAL REVIEW OF NEUROSCIENCE, 2005, 28 :403-450
[4]   Prefrontal cortex and decision making in a mixed-strategy game [J].
Barraclough, DJ ;
Conroy, ML ;
Lee, D .
NATURE NEUROSCIENCE, 2004, 7 (04) :404-410
[5]   Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].
Bayer, HM ;
Glimcher, PW .
NEURON, 2005, 47 (01) :129-141
[6]   Monkeys reject unequal pay [J].
Brosnan, SF ;
de Waal, FBM .
NATURE, 2003, 425 (6955) :297-299
[7]  
Burnham K.P., 2002, Model selection and multimodel inference: a practical information-theoretic approach, DOI 10.1007/978-1-4757-2917-7_3
[8]   Quantitative variation of incentive and performance in the white rat [J].
Crespi, LP .
AMERICAN JOURNAL OF PSYCHOLOGY, 1942, 55 :467-517
[9]   Cortical substrates for exploratory decisions in humans [J].
Daw, Nathaniel D. ;
O'Doherty, John P. ;
Dayan, Peter ;
Seymour, Ben ;
Dolan, Raymond J. .
NATURE, 2006, 441 (7095) :876-879
[10]   The computational neurobiology of learning and reward [J].
Daw, ND ;
Doya, K .
CURRENT OPINION IN NEUROBIOLOGY, 2006, 16 (02) :199-204