Multiple mini-robots navigation using a collaborative multiagent reinforcement learning framework

被引:9
作者
Chaysri, Piyabhum [1 ]
Blekas, Konstantinos [1 ]
Vlachos, Kostas [1 ]
机构
[1] Univ Ioannina, Dept Comp Sci & Engn, Ioannina 45110, Greece
关键词
Reinforcement learning; multi-agents; mini-robots; autonomous navigation; moving obstacles avoidance; MICROROBOTS; DESIGN;
D O I
10.1080/01691864.2020.1757507
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this work we investigate the use of a reinforcement learning (RL) framework for the autonomous navigation of a group of mini-robots in a multi-agent collaborative environment. Each mini-robot is driven by inertial forces provided by two vibration motors that are controlled by a simple and efficient low-level speed controller. The action of the RL agent is the direction of each mini-robot, and it is based on the position of each mini-robot, the distance between them and the sign of the distance gradient between each mini-robot and the nearest one. Each mini-robot is considered a moving obstacle that must be avoided by the others. We propose suitable state space and reward function that result in an efficient collaborative RL framework. The classical and the double Q-learning algorithms are employed, where the latter is considered to learn optimal policies of mini-robots that offers more stable and reliable learning process. A simulation environment is created, using the ROS framework, that include a group of four mini-robots. The dynamic model of each mini-robot and of the vibration motors is also included. Several application scenarios are simulated and the results are presented to demonstrate the performance of the proposed approach.
引用
收藏
页码:902 / 916
页数:15
相关论文
共 39 条
[1]   Compact Q-learning optimized for micro-robots with processing and memory constraints [J].
Asadpour, M ;
Siegwart, R .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2004, 48 (01) :49-61
[2]   Natural actor-critic algorithms [J].
Bhatnagar, Shalabh ;
Sutton, Richard S. ;
Ghavamzadeh, Mohammad ;
Lee, Mark .
AUTOMATICA, 2009, 45 (11) :2471-2482
[3]   Stick and Slip Actuators: design, control, performances and applications [J].
Breguet, JM ;
Clavel, R .
MHS '98, PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON MICROMECHATRONICS AND HUMAN SCIENCE, 1998, :89-95
[4]  
BRUFAU J, 2005, P IEEE INT C ROB AUT
[5]  
BUCHI R, 1995, SPIE PHOT E 95 P MIC
[6]  
CHAYSRI P, 2019, P IEEE INT C INF INT
[7]  
Degris T, 2012, P 29 INT COFERENCE I, P179
[8]  
Gebhardt GHW, 2018, IEEE INT CONF ROBOT, P7688
[9]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[10]  
Guestrin C., 2002, P INT C MACH LEARN, V2, P227