Evidence for a neural law of effect

被引:37
作者
Athalye, Vivek R. [1 ,2 ]
Santos, Fernando J. [1 ]
Carmena, Jose M. [2 ,3 ,4 ,5 ]
Costa, Rui M. [1 ,6 ,7 ]
机构
[1] Champalimaud Ctr Unknown, Champalimaud Neurosci Programme, P-1400038 Lisbon, Portugal
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
[4] Univ Calif Berkeley, Joint Grad Grp Bioengn, Berkeley, CA 94720 USA
[5] Univ Calif San Francisco, Berkeley, CA 94720 USA
[6] Columbia Univ, Zuckerman Mind Brain Behav Inst, Dept Neurosci, New York, NY 10032 USA
[7] Columbia Univ, Zuckerman Mind Brain Behav Inst, Dept Neurol, New York, NY 10032 USA
基金
欧洲研究理事会;
关键词
DENDRITIC SPINES; CORTICOSTRIATAL PLASTICITY; MOTOR CORTEX; CLOSED-LOOP; REINFORCEMENT; NEURONS; REWARD; BRAIN; STIMULATION; RECOMBINASE;
D O I
10.1126/science.aao6058
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Thorndike's law of effect states that actions that lead to reinforcements tend to be repeated more often. Accordingly, neural activity patterns leading to reinforcement are also reentered more frequently. Reinforcement relies on dopaminergic activity in the ventral tegmental area (VTA), and animals shape their behavior to receive dopaminergic stimulation. Seeking evidence for a neural law of effect, we found that mice learn to reenter more frequently motor cortical activity patterns that trigger optogenetic VTA self-stimulation. Learning was accompanied by gradual shaping of these patterns, with participating neurons progressively increasing and aligning their covariance to that of the target pattern. Motor cortex patterns that lead to phasic dopaminergic VTA activity are progressively reinforced and shaped, suggesting a mechanism by which animals select and shape actions to reliably achieve reinforcement.
引用
收藏
页码:1024 / 1029
页数:6
相关论文
共 45 条
[1]  
[Anonymous], 1984, An Introduction to Latent Variable Models
[2]  
[Anonymous], 1998, American Psychologist, DOI [DOI 10.1037/0003-066X.53.10.1125, DOI 10.1037/H0092987]
[3]  
[Anonymous], 2015, Reinforcement Learning: An Introduction
[4]   Emergence of Coordinated Neural Dynamics Underlies Neuroprosthetic Learning and Skillful Control [J].
Athalye, Vivek R. ;
Ganguly, Karunesh ;
Costa, Rui M. ;
Carmena, Jose M. .
NEURON, 2017, 93 (04) :955-+
[5]   Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories [J].
Barnes, TD ;
Kubota, Y ;
Hu, D ;
Jin, DZZ ;
Graybiel, AM .
NATURE, 2005, 437 (7062) :1158-1161
[6]   Motor Learning Consolidates Arc-Expressing Neuronal Ensembles in Secondary Motor Cortex [J].
Cao, Vania Yu ;
Ye, Yizhou ;
Mastwal, Surjeet ;
Ren, Ming ;
Coon, Matthew ;
Liu, Qing ;
Costa, Rui M. ;
Wang, Kuan Hong .
NEURON, 2015, 86 (06) :1385-1392
[7]   Learning to control a brain-machine interface for reaching and grasping by primates [J].
Carmena, JM ;
Lebedev, MA ;
Crist, RE ;
O'Doherty, JE ;
Santucci, DM ;
Dimitrov, DF ;
Patil, PG ;
Henriquez, CS ;
Nicolelis, MAL .
PLOS BIOLOGY, 2003, 1 (02) :193-208
[8]   Volitional modulation of optically recorded calcium signals during neuroprosthetic learning [J].
Clancy, Kelly B. ;
Koralek, Aaron C. ;
Costa, Rui M. ;
Feldman, Daniel E. ;
Carmena, Jose M. .
NATURE NEUROSCIENCE, 2014, 17 (06) :807-809
[9]   Variability in motor learning: relocating, channeling and reducing noise [J].
Cohen, R. G. ;
Sternad, D. .
EXPERIMENTAL BRAIN RESEARCH, 2009, 193 (01) :69-83
[10]   INTRA-CRANIAL SELF-STIMULATION IN RELATION TO THE ASCENDING DOPAMINERGIC SYSTEMS OF THE MIDBRAIN - MOVEABLE ELECTRODE MAPPING STUDY [J].
CORBETT, D ;
WISE, RA .
BRAIN RESEARCH, 1980, 185 (01) :1-15