Cortical delta activity reflects reward prediction error and related behavioral adjustments, but at different times

被引:104
作者
Cavanagh, James F. [1 ]
机构
[1] Univ New Mexico, Dept Psychol, Albuquerque, NM 87131 USA
关键词
Prediction error; Delta; Reward positivity; Reinforcement learning; Decision making; Hierarchy; FEEDBACK-RELATED NEGATIVITY; FRONTAL MIDLINE THETA; NEURONAL OSCILLATIONS; EVIDENCE ACCUMULATION; DECISION-MAKING; EEG DYNAMICS; MODEL; DOPAMINE; UNCERTAINTY; ADAPTATION;
D O I
10.1016/j.neuroimage.2015.02.007
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Recent work has suggested that reward prediction errors elicit a positive voltage deflection in the scalp-recorded electroencephalogram (EEG); an event sometimes termed a reward positivity. However, a strong test of this proposed relationship remains to be defined. Other important questions remain unaddressed: such as the role of the reward positivity in predicting future behavioral adjustments that maximize reward. To answer these questions, a three-armed bandit task was used to investigate the role of positive prediction errors during trial-by-trial exploration and task-set based exploitation. The feedback-locked reward positivity was characterized by delta band activities, and these related EEG features scaled with the degree of a computationally derived positive prediction error. However, these phenomena were also dissociated: the computational model predicted exploitative action selection and related response time speeding whereas the feedback-locked EEG features did not. Compellingly, delta band dynamics time-locked to the subsequent bandit (the P3) successfully predicted these behaviors. These bandit-locked findings included an enhanced parietal to motor cortex delta phase lag that correlated with the degree of response time speeding, suggesting a mechanistic role for delta band activities in motivating action selection. This dissociation in feedback vs. bandit locked EEG signals is interpreted as a differentiation in hierarchically distinct types of prediction error, yielding novel predictions about these dissociable delta band phenomena during reinforcement learning and decision making. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:205 / 216
页数:12
相关论文
共 76 条
[1]  
[Anonymous], 2005, Event related potentials : A methods handbook
[2]   Dissociated roles of the anterior cingulate cortex in reward and conflict processing as revealed by the feedback error-related negativity and N200 [J].
Baker, Travis E. ;
Holroyd, Clay B. .
BIOLOGICAL PSYCHOLOGY, 2011, 87 (01) :25-34
[3]   Externalizing Psychopathology and Gain-Loss Feedback in a Simulated Gambling Task: Dissociable Components of Brain Response Revealed by Time-Frequency Analysis [J].
Bernat, Edward M. ;
Nelson, Lindsay D. ;
Steele, Vaughn R. ;
Gehring, William J. ;
Patrick, Christopher J. .
JOURNAL OF ABNORMAL PSYCHOLOGY, 2011, 120 (02) :352-364
[4]   Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective [J].
Botvinick, Matthew M. ;
Niv, Yael ;
Barto, Andrew C. .
COGNITION, 2009, 113 (03) :262-280
[5]  
Bunge Silvia A., 2004, Cognitive Affective & Behavioral Neuroscience, V4, P564
[6]   Neuronal oscillations in cortical networks [J].
Buzsáki, G ;
Draguhn, A .
SCIENCE, 2004, 304 (5679) :1926-1929
[7]   Experience-weighted attraction learning in normal form games [J].
Camerer, C ;
Ho, TH .
ECONOMETRICA, 1999, 67 (04) :827-874
[8]   High gamma power is phase-locked to theta oscillations in human neocortex [J].
Canolty, R. T. ;
Edwards, E. ;
Dalal, S. S. ;
Soltani, M. ;
Nagarajan, S. S. ;
Kirsch, H. E. ;
Berger, M. S. ;
Barbaro, N. M. ;
Knight, R. T. .
SCIENCE, 2006, 313 (5793) :1626-1628
[9]   Axiomatic methods, dopamine and reward prediction error [J].
Caplin, Andrew ;
Dean, Mark .
CURRENT OPINION IN NEUROBIOLOGY, 2008, 18 (02) :197-202
[10]  
Cavanagh J. F., 2014, J PHYSL PAR IN PRESS