Neural correlates of risk prediction error during reinforcement learning in humans

被引:65
作者
d'Acremont, Mathieu [1 ]
Lu, Zhong-Lin [2 ]
Li, Xiangrui [2 ]
Van der Linden, Martial [1 ]
Bechara, Antoine [2 ]
机构
[1] Univ Geneva, Natl Ctr Competence Res NCCR Affect Sci, CH-1205 Geneva, Switzerland
[2] Univ So Calif, Dept Psychol, Dana & David Dornsife Cognit Neurosci Imaging Ctr, Brain & Creat Inst, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
HUMAN PREFRONTAL CORTEX; POSTERIOR CINGULATE CORTEX; DIRECT-CURRENT STIMULATION; DECISION-MAKING; ORBITOFRONTAL CORTEX; DOPAMINE NEURONS; SOMATIC MARKERS; TAKING BEHAVIOR; GAMBLING TASK; REWARD VALUE;
D O I
10.1016/j.neuroimage.2009.04.096
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Behavioral studies have shown for decades that humans are sensitive to risk when making decisions. More recently, brain activities have been shown to be correlated with risky choices. But an important gap needs to be filled: How does the human brain learn which decisions are risky? In cognitive neuroscience, reinforcement learning has never been used to estimate reward variance, a common measure of risk in economics and psychology. It is thus unknown which brain regions are involved in risk learning. To address this question, participants completed a decision-making task during fMRI They chose repetitively from four decks of cards and each selection was followed by a stochastic payoff. Expected reward and risk differed among the decks. Participants' aim was to maximize payoffs, Risk and reward prediction errors were calculated after each payoff based on a novel reinforcement learning model. For reward prediction error, the strongest correlation was found with the BOLD response in the striatum. For risk prediction error, the strongest correlation was found with the BOLD responses in the insula and inferior frontal gyrus. We conclude that risk and reward prediction errors are processed by distinct neural circuits during reinforcement learning. Additional analyses revealed that the BOLD response in the inferior frontal gyrus was more pronounced for risk aversive participants, suggesting that this region also serves to inhibit risky choices. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:1929 / 1939
页数:11
相关论文
共 64 条
[11]   COGNITIVE-PROCESSES IN ANXIETY [J].
BUTLER, G ;
MATHEWS, A .
ADVANCES IN BEHAVIOUR RESEARCH AND THERAPY, 1983, 5 (01) :51-62
[12]   The contributions of lesion laterality and lesion volume to decision-making impairment following frontal lobe damage [J].
Clark, L ;
Manes, F ;
Nagui, A ;
Sahakian, BJ ;
Robbins, TW .
NEUROPSYCHOLOGIA, 2003, 41 (11) :1474-1483
[13]   Temporal dynamics of brain activation during a working memory task [J].
Cohen, JD ;
Perlstein, WM ;
Braver, TS ;
Nystrom, LE ;
Noll, DC ;
Jonides, J ;
Smith, EE .
NATURE, 1997, 386 (6625) :604-608
[14]  
d'Acremont M, 2009, HANDBOOK OF REWARD AND DECISION MAKING, P459, DOI 10.1016/B978-0-12-374620-7.00022-4
[15]   Neurobiological studies of risk assessment: A comparison of expected utility and mean-variance approaches [J].
d'Acremont, Mathieu ;
Bossaerts, Peter .
COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE, 2008, 8 (04) :363-374
[16]   Phenomenal characteristics of autobiographical memories for social and non-social events in social phobia [J].
D'Argembeau, Arnaud ;
Van der Linden, Martial ;
d'Acremont, Mathieu ;
Mayers, Isabelle .
MEMORY, 2006, 14 (05) :637-647
[17]  
Damasio Antonio R., 1994, Descartes' Error: Emotion, Reason, and the Human Brain
[18]  
Damasio H, 2005, HUMAN BRAIN ANATOMY
[19]  
DIACONIS P, 1986, ANN STAT, V14, P1, DOI 10.1214/aos/1176349830
[20]   AUTOREGRESSIVE CONDITIONAL HETEROSCEDASTICITY WITH ESTIMATES OF THE VARIANCE OF UNITED-KINGDOM INFLATION [J].
ENGLE, RF .
ECONOMETRICA, 1982, 50 (04) :987-1007