Learning touch preferences with a tactile robot using dopamine modulated STDP in a model of insular cortex

被引:24
作者
Chou, Ting-Shuo [1 ]
Bucci, Liam D. [2 ]
Krichmar, Jeffrey L. [1 ,2 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Cognit Sci, 2224 Social & Behav Sci Gateway, Irvine, CA 92697 USA
关键词
tactile robot; reinforcement learning; dopamine; STDP; insular cortex; somatosensory cortex; TIMING-DEPENDENT PLASTICITY; SPIKING NEURAL-NETWORKS; DISTAL REWARD PROBLEM; TRAVELING-WAVES; PROPAGATING WAVES; DELAYED REWARDS; VISUAL-CORTEX; NEURONS; BEHAVIOR; RESPONSES;
D O I
10.3389/fnbot.2015.00006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neurorobots enable researchers to study how behaviors are produced by neural mechanisms in an uncertain, noisy, real-world environment. To investigate how the somatosensory system processes noisy, real-world touch inputs, we introduce a neurorobot called CARL-SJR, which has a full-body tactile sensory area. The design of CARL-SJR is such that it encourages people to communicate with it through gentle touch. CARL-SJR provides feedback to users by displaying bright colors on its surface. In the present study, we show that CARL-SJR is capable of learning associations between conditioned stimuli (CS; a color pattern on its surface) and unconditioned stimuli (US; a preferred touch pattern) by applying a spiking neural network (SNN) with neurobiologically inspired plasticity. Specifically, we modeled the primary somatosensory cortex, prefrontal cortex, striatum, and the insular cortex, which is important for hedonic touch, to process noisy data generated directly from CARL-SJR's tactile sensory area. To facilitate learning, we applied dopamine-modulated Spike Timing Dependent Plasticity (STDP) to our simulated prefrontal cortex, striatum, and insular cortex. To cope with noisy, varying inputs, the SNN was tuned to produce traveling waves of activity that carried spatiotemporal information. Despite the noisy tactile sensors, spike trains, and variations in subject hand swipes, the learning was quite robust. Further, insular cortex activities in the incremental pathway of dopaminergic reward system allowed us to control CARL-SJR's preference for touch direction without heavily pre-processed inputs. The emerged behaviors we found in this model match animal's behaviors wherein they prefer touch in particular areas and directions. Thus, the results in this paper could serve as an explanation on the underlying neural mechanisms for developing tactile preferences and hedonic touch.
引用
收藏
页数:18
相关论文
共 77 条
[1]   The Sensory Neurons of Touch [J].
Abraira, Victoria E. ;
Ginty, David D. .
NEURON, 2013, 79 (04) :618-639
[2]  
[Anonymous], 1995, Models of Information Processing in the Basal Ganglia
[3]   Standing waves and traveling waves distinguish two circuits in visual cortex [J].
Benucci, Andrea ;
Frazor, Robert A. ;
Carandini, Matteo .
NEURON, 2007, 55 (01) :103-117
[4]   A closed-loop neurobotic system for fine touch sensing [J].
Bologna, L. L. ;
Pinoteau, J. ;
Passot, J-B ;
Garrido, J. A. ;
Vogel, J. ;
Ros Vidal, E. ;
Arleo, A. .
JOURNAL OF NEURAL ENGINEERING, 2013, 10 (04)
[5]   Encoding/decoding of first and second order tactile afferents in a neurorobotic application [J].
Bologna, Luca Leonardo ;
Pinoteau, Jeremie ;
Brasselet, Romain ;
Maggiali, Marco ;
Arleo, Angelo .
JOURNAL OF PHYSIOLOGY-PARIS, 2011, 105 (1-3) :25-35
[6]   Opponency Revisited: Competition and Cooperation Between Dopamine and Serotonin [J].
Boureau, Y-Lan ;
Dayan, Peter .
NEUROPSYCHOPHARMACOLOGY, 2011, 36 (01) :74-97
[7]   Role of expressive behaviour for robots that learn from people [J].
Breazeal, Cynthia .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2009, 364 (1535) :3527-3538
[8]  
Brown J, 1999, J NEUROSCI, V19, P10502
[9]  
Bucci L. D., 2014, ROB AUT ICRA 2014 IE
[10]   Memory consolidation during sleep: a neurophysiological perspective [J].
Buzsaki, G .
JOURNAL OF SLEEP RESEARCH, 1998, 7 :17-23