Interference and Shaping in Sensorimotor Adaptations with Rewards

被引:13
作者
Darshan, Ran [1 ,2 ,3 ]
Leblois, Arthur [2 ]
Hansel, David [2 ,3 ,4 ]
机构
[1] Hebrew Univ Jerusalem, Edmond & Lily Safra Ctr Brain Sci, Jerusalem, Israel
[2] Univ Paris 05, CNRS, UMR8119, Lab Neurophys & Physiol, F-75270 Paris, France
[3] Hebrew Univ Jerusalem, Interdisciplinary Ctr Neural Computat, Jerusalem, Israel
[4] Hebrew Univ Jerusalem, Alexander Silberman Inst Life Sci, Jerusalem, Israel
关键词
LONG-TERM POTENTIATION; MOTOR; REINFORCEMENT; PLASTICITY; MODEL; DISCRIMINATION; VARIABILITY; BIRDSONG;
D O I
10.1371/journal.pcbi.1003377
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
When a perturbation is applied in a sensorimotor transformation task, subjects can adapt and maintain performance by either relying on sensory feedback, or, in the absence of such feedback, on information provided by rewards. For example, in a classical rotation task where movement endpoints must be rotated to reach a fixed target, human subjects can successfully adapt their reaching movements solely on the basis of binary rewards, although this proves much more difficult than with visual feedback. Here, we investigate such a reward-driven sensorimotor adaptation process in a minimal computational model of the task. The key assumption of the model is that synaptic plasticity is gated by the reward. We study how the learning dynamics depend on the target size, the movement variability, the rotation angle and the number of targets. We show that when the movement is perturbed for multiple targets, the adaptation process for the different targets can interfere destructively or constructively depending on the similarities between the sensory stimuli (the targets) and the overlap in their neuronal representations. Destructive interferences can result in a drastic slowdown of the adaptation. As a result of interference, the time to adapt varies non-linearly with the number of targets. Our analysis shows that these interferences are weaker if the reward varies smoothly with the subject's performance instead of being binary. We demonstrate how shaping the reward or shaping the task can accelerate the adaptation dramatically by reducing the destructive interferences. We argue that experimentally investigating the dynamics of reward-driven sensorimotor adaptation for more than one sensory stimulus can shed light on the underlying learning rules.
引用
收藏
页数:20
相关论文
共 51 条
[1]   A basal ganglia-forebrain circuit in the songbird biases motor output to avoid vocal errors [J].
Andalman, Aaron S. ;
Fee, Michale S. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (30) :12518-12523
[2]  
[Anonymous], 2015, Table of Integrals, Series, and Products
[3]   Spatial generalization of learning in smooth pursuit eye movements: Implications for the coordinate frame and sites of learning [J].
Chou, IH ;
Lisberger, SG .
JOURNAL OF NEUROSCIENCE, 2002, 22 (11) :4728-4739
[4]   Long-term potentiation in an avian basal ganglia nucleus essential for vocal learning [J].
Ding, L ;
Perkel, DJ .
JOURNAL OF NEUROSCIENCE, 2004, 24 (02) :488-494
[5]  
Donchin O, 2003, J NEUROSCI, V23, P9032
[6]   Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances [J].
Fiete, Ila R. ;
Fee, Michale S. ;
Seung, H. Sebastian .
JOURNAL OF NEUROPHYSIOLOGY, 2007, 98 (04) :2038-2057
[7]   Gradient learning in spiking neural networks by dynamic perturbation of conductances [J].
Fiete, Ila R. ;
Seung, H. Sebastian .
PHYSICAL REVIEW LETTERS, 2006, 97 (04)
[8]   Temporal sparseness of the premotor drive is important for rapid learning in a neural network model of birdsong [J].
Fiete, IR ;
Hahnloser, RHR ;
Fee, MS ;
Seung, HS .
JOURNAL OF NEUROPHYSIOLOGY, 2004, 92 (04) :2274-2282
[9]   Functional Requirements for Reward-Modulated Spike-Timing-Dependent Plasticity [J].
Fremaux, Nicolas ;
Sprekeler, Henning ;
Gerstner, Wulfram .
JOURNAL OF NEUROSCIENCE, 2010, 30 (40) :13326-13337
[10]   Anatomy of a songbird basal ganglia circuit essential for vocal learning and plasticity [J].
Gale, Samuel D. ;
Perkel, David J. .
JOURNAL OF CHEMICAL NEUROANATOMY, 2010, 39 (02) :124-131