Reversal Learning and Dopamine: A Bayesian Perspective

被引:94
作者
Costa, Vincent D. [1 ]
Tran, Valery L. [1 ]
Turchi, Janita [1 ]
Averbeck, Bruno B. [1 ]
机构
[1] NIMH, Neuropsychol Lab, NIH, Bethesda, MD 20892 USA
关键词
Bayesian; dopamine; haloperidol; L-DOPA; reinforcement learning; reversal learning; PREFRONTAL CORTEX; PARKINSONS-DISEASE; ORBITOFRONTAL CORTEX; NUCLEUS-ACCUMBENS; FRONTAL-CORTEX; IN-VIVO; LESIONS; DISCRIMINATION; REINFORCEMENT; SENSITIVITY;
D O I
10.1523/JNEUROSCI.1989-14.2015
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Reversal learning has been studied as the process of learning to inhibit previously rewarded actions. Deficits in reversal learning have been seen after manipulations of dopamine and lesions of the orbitofrontal cortex. However, reversal learning is often studied in animals that have limited experience with reversals. As such, the animals are learning that reversals occur during data collection. We have examined a task regime in which monkeys have extensive experience with reversals and stable behavioral performance on a probabilistic two-arm bandit reversal learning task. We developed a Bayesian analysis approach to examine the effects of manipulations of dopamine on reversal performance in this regime. We find that the analysis can clarify the strategy of the animal. Specifically, at reversal, the monkeys switch quickly from choosing one stimulus to choosing the other, as opposed to gradually transitioning, which might be expected if they were using a naive reinforcement learning (RL) update of value. Furthermore, we found that administration of haloperidol affects the way the animals integrate prior knowledge into their choice behavior. Animals had a stronger prior on where reversals would occur on haloperidol than on levodopa (L-DOPA) or placebo. This strong prior was appropriate, because the animals had extensive experience with reversals occurring in the middle of the block. Overall, we find that Bayesian dissection of the behavior clarifies the strategy of the animals and reveals an effect of haloperidol on integration of prior information with evidence in favor of a choice reversal.
引用
收藏
页码:2407 / 2416
页数:10
相关论文
共 31 条
[11]   Differential influence of levodopa on reward-based learning in Parkinson's disease [J].
Graef, Susanne ;
Biele, Guido ;
Krugel, Lea K. ;
Marzinzik, Frank ;
Wahl, Michael ;
Wotka, Johann ;
Klostermann, Fabian ;
Heekeren, Hauke R. .
FRONTIERS IN HUMAN NEUROSCIENCE, 2010, 4
[12]   Dorsal Striatal D2-Like Receptor Availability Covaries with Sensitivity to Positive Reinforcement during Discrimination Learning [J].
Groman, Stephanie M. ;
Lee, Buyean ;
London, Edythe D. ;
Mandelkern, Mark A. ;
James, Alex S. ;
Feiler, Karen ;
Rivera, Ronald ;
Dahlbom, Magnus ;
Sossi, Vesna ;
Vandervoort, Eric ;
Jentsch, J. David .
JOURNAL OF NEUROSCIENCE, 2011, 31 (20) :7291-7299
[13]   Reward-related reversal learning after surgical excisions in orbito-frontal or dorsolateral prefrontal cortex in humans [J].
Hornak, J ;
O'Doherty, J ;
Bramham, J ;
Rolls, ET ;
Morris, RG ;
Bullock, PR ;
Polkey, CE .
JOURNAL OF COGNITIVE NEUROSCIENCE, 2004, 16 (03) :463-478
[14]   Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency [J].
Izquierdo, A ;
Suda, RK ;
Murray, EA .
JOURNAL OF NEUROSCIENCE, 2004, 24 (34) :7540-7548
[15]   LIMBIC LESIONS AND PROBLEM OF STIMULUS-REINFORCEMENT ASSOCIATIONS [J].
JONES, B ;
MISHKIN, M .
EXPERIMENTAL NEUROLOGY, 1972, 36 (02) :362-+
[16]  
Kuroki T, 1999, J PHARMACOL EXP THER, V288, P774
[17]   Dopamine D2/D3 receptors play a specific role in the reversal of a learned visual discrimination in monkeys [J].
Lee, Buyean ;
Groman, Stephanie ;
London, Edythe D. ;
Jentsch, James David .
NEUROPSYCHOPHARMACOLOGY, 2007, 32 (10) :2125-2134
[18]   A CONTROLLED, POWERFUL MULTIPLE-COMPARISON STRATEGY FOR SEVERAL SITUATIONS [J].
LEVIN, JR ;
SERLIN, RC ;
SEAMAN, MA .
PSYCHOLOGICAL BULLETIN, 1994, 115 (01) :153-159
[19]   Improved short-term spatial memory but impaired following the dopamine D2 agonist bromocriptine reversal learning in human volunteers [J].
Mehta, MA ;
Swainson, R ;
Ogilvie, AD ;
Sahakian, BJ ;
Robbins, TW .
PSYCHOPHARMACOLOGY, 2001, 159 (01) :10-20
[20]   A liquid-delivery device that provides precise reward control for neurophysiological and behavioral experiments [J].
Mitz, AR .
JOURNAL OF NEUROSCIENCE METHODS, 2005, 148 (01) :19-25