Learning touch preferences with a tactile robot using dopamine modulated STDP in a model of insular cortex

被引:24
作者
Chou, Ting-Shuo [1 ]
Bucci, Liam D. [2 ]
Krichmar, Jeffrey L. [1 ,2 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Cognit Sci, 2224 Social & Behav Sci Gateway, Irvine, CA 92697 USA
关键词
tactile robot; reinforcement learning; dopamine; STDP; insular cortex; somatosensory cortex; TIMING-DEPENDENT PLASTICITY; SPIKING NEURAL-NETWORKS; DISTAL REWARD PROBLEM; TRAVELING-WAVES; PROPAGATING WAVES; DELAYED REWARDS; VISUAL-CORTEX; NEURONS; BEHAVIOR; RESPONSES;
D O I
10.3389/fnbot.2015.00006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neurorobots enable researchers to study how behaviors are produced by neural mechanisms in an uncertain, noisy, real-world environment. To investigate how the somatosensory system processes noisy, real-world touch inputs, we introduce a neurorobot called CARL-SJR, which has a full-body tactile sensory area. The design of CARL-SJR is such that it encourages people to communicate with it through gentle touch. CARL-SJR provides feedback to users by displaying bright colors on its surface. In the present study, we show that CARL-SJR is capable of learning associations between conditioned stimuli (CS; a color pattern on its surface) and unconditioned stimuli (US; a preferred touch pattern) by applying a spiking neural network (SNN) with neurobiologically inspired plasticity. Specifically, we modeled the primary somatosensory cortex, prefrontal cortex, striatum, and the insular cortex, which is important for hedonic touch, to process noisy data generated directly from CARL-SJR's tactile sensory area. To facilitate learning, we applied dopamine-modulated Spike Timing Dependent Plasticity (STDP) to our simulated prefrontal cortex, striatum, and insular cortex. To cope with noisy, varying inputs, the SNN was tuned to produce traveling waves of activity that carried spatiotemporal information. Despite the noisy tactile sensors, spike trains, and variations in subject hand swipes, the learning was quite robust. Further, insular cortex activities in the incremental pathway of dopaminergic reward system allowed us to control CARL-SJR's preference for touch direction without heavily pre-processed inputs. The emerged behaviors we found in this model match animal's behaviors wherein they prefer touch in particular areas and directions. Thus, the results in this paper could serve as an explanation on the underlying neural mechanisms for developing tactile preferences and hedonic touch.
引用
收藏
页数:18
相关论文
共 77 条
[11]  
Cannata G., 2008, MULT FUS INT INT SYS
[12]   Spike timing-dependent plasticity: A Hebbian learning rule [J].
Caporale, Natalia ;
Dan, Yang .
ANNUAL REVIEW OF NEUROSCIENCE, 2008, 31 :25-46
[13]   An efficient automated parameter tuning framework for spiking neural networks [J].
Carlson, Kristofor D. ;
Nageswaran, Jayram Moorkanikara ;
Dutt, Nikil ;
Krichmar, Jeffrey L. .
FRONTIERS IN NEUROSCIENCE, 2014, 8
[14]   Versatile networks of simulated spiking neurons displaying winner-take-all behavior [J].
Chen, Yanqing ;
McKinstry, Jeffrey L. ;
Edelman, Gerald M. .
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2013, 7
[15]   Dopamine-signaled reward predictions generated by competitive excitation and inhibition in a spiking neural network model [J].
Chorley, Paul ;
Seth, Anil K. .
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2011, 5
[16]   How do you feel - now? The anterior insula and human awareness [J].
Craig, A. D. .
NATURE REVIEWS NEUROSCIENCE, 2009, 10 (01) :59-70
[17]   How do you feel? Interoception: the sense of the physiological condition of the body [J].
Craig, AD .
NATURE REVIEWS NEUROSCIENCE, 2002, 3 (08) :655-666
[18]   A THALAMIC NUCLEUS SPECIFIC FOR PAIN AND TEMPERATURE SENSATION [J].
CRAIG, AD ;
BUSHNELL, MC ;
ZHANG, ET ;
BLOMQVIST, A .
NATURE, 1994, 372 (6508) :770-773
[19]   Tactile Sensing-From Humans to Humanoids [J].
Dahiya, Ravinder S. ;
Metta, Giorgio ;
Valle, Maurizio ;
Sandini, Giulio .
IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (01) :1-20
[20]   Extending the effects of spike-timing-dependent plasticity to behavioral timescales [J].
Drew, Patrick J. ;
Abbott, L. F. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (23) :8876-8881