A Hybrid Spiking Neural Network Reinforcement Learning Agent for Energy-Efficient Object Manipulation

被引:14
作者
Oikonomou, Katerina Maria [1 ]
Kansizoglou, Ioannis [1 ]
Gasteratos, Antonios [1 ]
机构
[1] Democritus Univ Thrace, Dept Prod & Management Engn, Vas Sophias 12, GR-67132 Xanthi, Greece
关键词
spiking neural networks; robotic manipulation; hybrid DDPG; actor-critic; MODEL;
D O I
10.3390/machines11020162
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the wide spread of robotics technologies in everyday activities, from industrial automation to domestic assisted living applications, cutting-edge techniques such as deep reinforcement learning are intensively investigated with the aim to advance the technological robotics front. The mandatory limitation of power consumption remains an open challenge in contemporary robotics, especially in real-case applications. Spiking neural networks (SNN) constitute an ideal compromise as a strong computational tool with low-power capacities. This paper introduces a spiking neural network actor for a baseline robotic manipulation task using a dual-finger gripper. To achieve that, we used a hybrid deep deterministic policy gradient (DDPG) algorithm designed with a spiking actor and a deep critic network to train the robotic agent. Thus, the agent learns to obtain the optimal policies for the three main tasks of the robotic manipulation approach: target-object reach, grasp, and transfer. The proposed method has one of the main advantages that an SNN possesses, namely, its neuromorphic hardware implementation capacity that results in energy-efficient implementations. The latter accomplishment is highly demonstrated in the evaluation results of the SNN actor since the deep critic network was exploited only during training. Aiming to further display the capabilities of the introduced approach, we compare our model with the well-established DDPG algorithm.
引用
收藏
页数:17
相关论文
共 50 条
[1]   Real-Time Monocular Human Depth Estimation and Segmentation on Embedded Systems [J].
An, Shan ;
Zhou, Fangru ;
Yang, Mei ;
Zhu, Haogang ;
Fu, Changhong ;
Tsintotas, Konstantinos A. .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :55-62
[2]   Mapping Spiking Neural Networks to Neuromorphic Hardware [J].
Balaji, Adarsha ;
Das, Anup ;
Wu, Yuefeng ;
Huynh, Khanh ;
Dell'Anna, Francesco G. ;
Indiveri, Giacomo ;
Krichmar, Jeffrey L. ;
Dutt, Nikil D. ;
Schaafsma, Siebren ;
Catthoor, Francky .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (01) :76-86
[3]   Human-like Reflexes for Robotic Manipulation using Leaky Integrate-and-Fire Neurons [J].
Bauer, Christian ;
Milighetti, Giulio ;
Yan, Wenjie ;
Mikut, Ralf .
IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, :2572-2577
[4]  
Bohte S. M., 2000, 8th European Symposium on Artificial Neural Networks. ESANN"2000. Proceedings, P419
[5]  
Bouganis A., 2010, P 2010 INT JOINT C N, P1, DOI [10.1109/IJCNN.2010.5596525, DOI 10.1109/IJCNN.2010.5596525]
[6]   A review of the integrate-and-fire neuron model: I. Homogeneous synaptic input [J].
Burkitt, A. N. .
BIOLOGICAL CYBERNETICS, 2006, 95 (01) :1-19
[7]   Spike timing-dependent plasticity: A Hebbian learning rule [J].
Caporale, Natalia ;
Dan, Yang .
ANNUAL REVIEW OF NEUROSCIENCE, 2008, 31 :25-46
[8]  
Cheng R, 2018, PR MACH LEARN RES, V87
[9]   Model Compression and Acceleration for Deep Neural Networks The principles, progress, and challenges [J].
Cheng, Yu ;
Wang, Duo ;
Zhou, Pan ;
Zhang, Tao .
IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) :126-136
[10]  
Dalal Murtaza, 2021, Advances in Neural Information Processing Systems, V34