Reinforcement-Learning-Based Route Generation for Heavy-Traffic Autonomous Mobile Robot Systems

被引：9

作者：

Kozjek, Dominik ^{[1
]}

Malus, Andreja ^{[1
]}

Vrabic, Rok ^{[1
]}

机构：

[1] Univ Ljubljana, Fac Mech Engn, SI-1000 Ljubljana, Slovenia

来源：

SENSORS | 2021年 / 21卷 / 14期

关键词：

intralogistics; autonomous mobile robots; multi-robot cooperation; reinforcement learning; route planning;

D O I：

10.3390/s21144809

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Autonomous mobile robots (AMRs) are increasingly used in modern intralogistics systems as complexity and performance requirements become more stringent. One way to increase performance is to improve the operation and cooperation of multiple robots in their shared environment. The paper addresses these problems with a method for off-line route planning and on-line route execution. In the proposed approach, pre-computation of routes for frequent pick-up and drop-off locations limits the movements of AMRs to avoid conflict situations between them. The paper proposes a reinforcement learning approach where an agent builds the routes on a given layout while being rewarded according to different criteria based on the desired characteristics of the system. The results show that the proposed approach performs better in terms of throughput and reliability than the commonly used shortest-path-based approach for a large number of AMRs operating in the system. The use of the proposed approach is recommended when the need for high throughput requires the operation of a relatively large number of AMRs in relation to the size of the space in which the robots operate.

引用

页数：19

共 19 条

[11] Kleiner A, 2011, IEEE INT C INT ROBOT, P3276, DOI 10.1109/IROS.2011.6048339
[12] MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement Learning in Mixed Dynamic Environments
Liu, Zuxin
Chen, Baiming
Zhou, Hongyi
Koushik, Guru
Hebert, Martial
Zhao, Ding
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 11748 - 11754
[13] Olmi Roberto, 2011, International Journal of Vehicle Autonomous Systems, V9, P5, DOI 10.1504/IJVAS.2011.038177
[14] Pecora F, 2012, IEEE INT C INT ROBOT, P5262, DOI 10.1109/IROS.2012.6385862
[15] PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning
Sartoretti, Guillaume
Kerr, Justin
Shi, YunFei
Wagner, Glenn
Kumar, T. K. Satish
Koenig, Sven
Choset, Howie
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03): : 2378 - 2385
[16] Schulman J., 2017, 170706347 ARXIV
[17] Stable Baselines, GITHUB REP GITHUB REP
[18] Uttendorf S, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), P977, DOI 10.1109/IEEM.2016.7798023
[19] Xiao X., 2020, 201113112 ARXIV 201113112 ARXIV

← 1 2 →