A Hybrid Spiking Neural Network Reinforcement Learning Agent for Energy-Efficient Object Manipulation

被引：14

作者：

Oikonomou, Katerina Maria ^{[1
]}

Kansizoglou, Ioannis ^{[1
]}

Gasteratos, Antonios ^{[1
]}

机构：

[1] Democritus Univ Thrace, Dept Prod & Management Engn, Vas Sophias 12, GR-67132 Xanthi, Greece

来源：

MACHINES | 2023年 / 11卷 / 02期

关键词：

spiking neural networks; robotic manipulation; hybrid DDPG; actor-critic; MODEL;

D O I：

10.3390/machines11020162

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Due to the wide spread of robotics technologies in everyday activities, from industrial automation to domestic assisted living applications, cutting-edge techniques such as deep reinforcement learning are intensively investigated with the aim to advance the technological robotics front. The mandatory limitation of power consumption remains an open challenge in contemporary robotics, especially in real-case applications. Spiking neural networks (SNN) constitute an ideal compromise as a strong computational tool with low-power capacities. This paper introduces a spiking neural network actor for a baseline robotic manipulation task using a dual-finger gripper. To achieve that, we used a hybrid deep deterministic policy gradient (DDPG) algorithm designed with a spiking actor and a deep critic network to train the robotic agent. Thus, the agent learns to obtain the optimal policies for the three main tasks of the robotic manipulation approach: target-object reach, grasp, and transfer. The proposed method has one of the main advantages that an SNN possesses, namely, its neuromorphic hardware implementation capacity that results in energy-efficient implementations. The latter accomplishment is highly demonstrated in the evaluation results of the SNN actor since the deep critic network was exploited only during training. Aiming to further display the capabilities of the introduced approach, we compare our model with the well-established DDPG algorithm.

引用

页数：17

共 50 条

[1] Real-Time Monocular Human Depth Estimation and Segmentation on Embedded Systems [J].