Modulation of value-based decision making behavior by subregions of the rat prefrontal cortex

被引:50
作者
Verharen, Jeroen P. H. [1 ,2 ,3 ]
den Ouden, Hanneke E. M. [4 ]
Adan, Roger A. H. [1 ]
Vanderschuren, Louk J. M. J. [2 ]
机构
[1] Univ Med Ctr Utrecht, Brain Ctr Rudolf Magnus, Dept Translat Neurosci, NL-3584 CG Utrecht, Netherlands
[2] Univ Utrecht, Dept Anim Sci & Soc, Div Behav Neurosci, Fac Vet Med, NL-3584 CM Utrecht, Netherlands
[3] Univ Calif Berkeley, Dept Mol & Cell Biol, Helen Wills Neurosci Inst, 229 Stanley Hall, Berkeley, CA 94720 USA
[4] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, NL-6525 HR Nijmegen, Netherlands
关键词
Value; Prefrontal cortex; Decision-making; Reinforcement learning; Behavioral modeling; Rats; Reward; Punishment; ORBITOFRONTAL CORTEX; FUNCTIONAL-HETEROGENEITY; REWARD; PREDICTION; AMYGDALA; HABITS; BRAIN; MODEL; STAY; TASK;
D O I
10.1007/s00213-020-05454-7
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Rationale During value-based decision-making, organisms make choices on the basis of reward expectations, which have been formed during prior action-outcome learning. Although it is known that neuronal manipulations of different subregions of the rat prefrontal cortex (PFC) have qualitatively different effects on behavioral tasks involving value-based decision-making, it is unclear how these regions contribute to the underlying component processes. Objectives Assessing how different regions of the rodent PFC contribute to component processes of value-based decision-making behavior, including reward (or positive feedback) learning, punishment (or negative feedback) learning, response persistence, and exploration versus exploitation. Methods We performed behavioral modeling of data of rats in a probabilistic reversal learning task after pharmacological inactivation of five PFC subregions, to assess how inactivation of these different regions affected the structure of responding of animals in the task. Results Our results show reductions in reward and punishment learning after PFC subregion inactivation. The prelimbic, infralimbic, lateral orbital, and medial orbital PFC particularly contributed to punishment learning, and the prelimbic and lateral orbital PFC to reward learning. In addition, response persistence depended on the infralimbic and medial orbital PFC. As a result, pharmacological inactivation of the infralimbic and lateral orbitofrontal cortex reduced the number of reversals achieved, whereas inactivation of the prelimbic and medial orbitofrontal cortex decreased the number of rewards obtained. Finally, using simulated data, we explain discrepancies with a previous study and demonstrate complex, interacting relationships between conventional measures of probabilistic reversal learning performance, such as win-stay/lose-switch behavior, and component processes of value-based decision-making. Conclusions Together, our data suggest that distinct components of value-based learning and decision-making are generated in medial and orbital PFC regions, displaying functional specialization and overlap, with a prominent role of large parts of the PFC in negative feedback processing.
引用
收藏
页码:1267 / 1280
页数:14
相关论文
共 42 条
[31]   Primate orbitofrontal cortex and adaptive behaviour [J].
Roberts, AC .
TRENDS IN COGNITIVE SCIENCES, 2006, 10 (02) :83-90
[32]   The Orbitofrontal Oracle: Cortical Mechanisms for the Prediction and Evaluation of Specific Behavioral Outcomes [J].
Rudebeck, Peter H. ;
Murray, Elisabeth A. .
NEURON, 2014, 84 (06) :1143-1156
[33]   Valuation and decision-making in frontal cortex: one or many serial or parallel systems? [J].
Rushworth, Matthew F. S. ;
Kolling, Nils ;
Sallet, Jerome ;
Mars, Rogier B. .
CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) :946-955
[34]   A neural substrate of prediction and reward [J].
Schultz, W ;
Dayan, P ;
Montague, PR .
SCIENCE, 1997, 275 (5306) :1593-1599
[35]   Prefrontal Cortical Contribution to Risk-Based Decision Making [J].
St Onge, Jennifer R. ;
Floresco, Stan B. .
CEREBRAL CORTEX, 2010, 20 (08) :1816-1828
[36]  
Sutton R.S., 1998, Reinforcement Learning: An Introduction, V135
[37]   Differential contributions of striatal dopamine D1 and D2 receptors to component processes of value-based decision making [J].
Verharen, Jeroen P. H. ;
Adan, Roger A. H. ;
Vanderschuren, Louk J. M. J. .
NEUROPSYCHOPHARMACOLOGY, 2019, 44 (13) :2195-2204
[38]   How Reward and Aversion Shape Motivation and Decision Making: A Computational Account [J].
Verharen, Jeroen P. H. ;
Adan, Roger A. H. ;
Vanderschuren, Louk J. M. J. .
NEUROSCIENTIST, 2020, 26 (01) :87-99
[39]   Reinforcement learning across the rat estrous cycle [J].
Verharen, Jeroen P. H. ;
Kentrop, Jiska ;
Vanderschuren, Louk J. M. J. ;
Adan, Roger A. H. .
PSYCHONEUROENDOCRINOLOGY, 2019, 100 :27-31
[40]   A neuronal mechanism underlying decision-making deficits during hyperdopaminergic states [J].
Verharen, Jeroen P. H. ;
de Jong, Johannes W. ;
Roelofs, Theresia J. M. ;
Huffels, Christiaan F. M. ;
van Zessen, Ruud ;
Luijendijk, Mieneke C. M. ;
Hamelink, Ralph ;
Willuhn, Ingo ;
den Ouden, Hanneke E. M. ;
van der Plasse, Geoffrey ;
Adan, Roger A. H. ;
Vanderschuren, Louk J. M. J. .
NATURE COMMUNICATIONS, 2018, 9