The effect of novelty on reinforcement learning

被引:24
作者
Houillon, A. [1 ,2 ,3 ]
Lorenz, R. C. [3 ,4 ]
Boehmer, W. [2 ]
Rapp, M. A. [3 ]
Heinz, A. [3 ]
Gallinat, J. [3 ]
Obermayer, K. [1 ,2 ]
机构
[1] Bernstein Ctr Computat Neurosci, Berlin, Germany
[2] Tech Univ Berlin, Dept Software Engn & Theoret Comp Sci, Neural Informat Proc Grp, Berlin, Germany
[3] Charite, Univ Med Berlin Campus Charite Mitte, Clin Psychiat & Psychotherapy, D-13353 Berlin, Germany
[4] Humboldt Univ, Dept Psychol, D-10099 Berlin, Germany
来源
DECISION MAKING: NEURAL AND BEHAVIOURAL APPROACHES | 2013年 / 202卷
关键词
novelty; reward; novelty-seeking trait; reinforcement learning; TRIDIMENSIONAL PERSONALITY QUESTIONNAIRE; NORMATIVE DATA; REWARD; DOPAMINE; PREDICTION; RESPONSES; ANTICIPATION; TEMPERAMENT; UNCERTAINTY; RELIABILITY;
D O I
10.1016/B978-0-444-62604-2.00021-6
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Recent research suggests that novelty has an influence on reward-related learning. Here, we showed that novel stimuli presented from a pre-familiarized category can accelerate or decelerate learning of the most rewarding category, depending on the condition. The extent of this influence depended on the individual trait of novelty seeking. Different reinforcement learning models were developed to quantify subjects' choices. We introduced a bias parameter to model explorative behavior toward novel stimuli and characterize individual variation in novelty response. The theoretical framework allowed us to test different assumptions, concerning the motivational value of novelty. The best fitting-model combined all novelty components and had a significant positive correlation with both the experimentally measured novelty bias and the independent novelty-seeking trait. Altogether, we have not only shown that novelty by itself enhances behavioral responses underlying reward processing, but also that novelty has a direct influence on reward-dependent learning processes, consistently with computational predictions.
引用
收藏
页码:415 / 439
页数:25
相关论文
共 48 条
[1]   Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits [J].
Balleine, BW .
PHYSIOLOGY & BEHAVIOR, 2005, 86 (05) :717-730
[2]   Separate encoding of model-based and model-free valuations in the human brain [J].
Beierholm, Ulrik R. ;
Anen, Cedric ;
Quartz, Steven ;
Bossaerts, Peter .
NEUROIMAGE, 2011, 58 (03) :955-962
[3]   The temperament - Und charakter-inventar [J].
Berth, H .
DIAGNOSTICA, 2001, 47 (01) :51-53
[4]   Swedish normative data on personality using the temperament and character inventory [J].
Brandstrom, S ;
Schlette, P ;
Przybeck, TR ;
Lundberg, M ;
Forsgren, T ;
Sigvardsson, S ;
Nylander, PO ;
Nilsson, LG ;
Cloninger, RC ;
Adolfsson, R .
COMPREHENSIVE PSYCHIATRY, 1998, 39 (03) :122-128
[5]   Mesolimbic novelty processing in older adults [J].
Bunzeck, Nico ;
Schuetze, Hartmut ;
Stallforth, Sabine ;
Kaufmann, Joern ;
Duezel, Sandra ;
Heinze, Hans-Jochen ;
Duezel, Emrah .
CEREBRAL CORTEX, 2007, 17 (12) :2940-2948
[6]   Absolute coding of stimulus novelty in the human substantia nigra/VTA [J].
Bunzeck, Nico ;
Duzel, Emrah .
NEURON, 2006, 51 (03) :369-379
[7]   THE TRIDIMENSIONAL PERSONALITY QUESTIONNAIRE - UNITED-STATES NORMATIVE DATA [J].
CLONINGER, CR ;
PRZYBECK, TR ;
SVRAKIC, DM .
PSYCHOLOGICAL REPORTS, 1991, 69 (03) :1047-1057
[8]   DETERMINANTS OF EXPLORATION AND NEOPHOBIA [J].
COREY, DT .
NEUROSCIENCE AND BIOBEHAVIORAL REVIEWS, 1978, 2 (04) :235-253
[9]   BOLD responses reflecting dopaminergic signals in the human ventral tegmental area [J].
D'Ardenne, Kimberlee ;
McClure, Samuel M. ;
Nystrom, Leigh E. ;
Cohen, Jonathan D. .
SCIENCE, 2008, 319 (5867) :1264-1267
[10]  
Daw N. D., 2009, Decision making, affect and learning, P3