Multi-Agent Reinforcement Learning for Side-by-Side Navigation of Autonomous Wheelchairs

被引：0

作者：

Fonseca, Tiago ^{[1
]}

Leao, Goncalo ^{[2
]}

Ferreira, Luis Lino ^{[1
,3
]}

Sousa, Armando ^{[2
]}

Severino, Ricardo ^{[1
]}

Reis, Luis Paulo

机构：

[1] Polytech Porto, ISEP, INESC TEC, Porto, Portugal

[2] Univ Porto, FEUP Fac Engn, INESC TEC, Porto, Portugal

[3] Univ Porto, FEUP Fac Engn, LIACC, Artificial Intelligence & Comp Sci Lab, Porto, Portugal

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC | 2024年

关键词：

Intelligent Robotics; Multi-Agent; Reinforcement Learning; Robot Operating System (ROS); MODEL;

D O I：

10.1109/ICARSC61747.2024.10535919

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper explores the use of Robotics and decentralized Multi-Agent Reinforcement Learning (MARL) for side-by-side navigation in Intelligent Wheelchairs (IW). Evolving from a previous work approach using traditional single-agent methodologies, it adopts a Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm to provide control input and enable a pair of IW to be deployed as decentralized computing agents in real-world environments, discarding the need to rely on communication between each other. In this study, the Flatland 2D simulator, in conjunction with the Robot Operating System (ROS), is used as a realistic environment to train and test the navigation algorithm. An overhaul of the reward function is introduced, which now provides individual rewards for each agent and revised reward incentives. Additionally, the logic for identifying side-by-side navigation was improved, to encourage dynamic alignment control. The preliminary results outline a promising research direction, with the IWs learning to navigate in various realistic hallways testing scenarios. The outcome also suggests that while the MADDPG approach holds potential over single-agent techniques for the decentralized IW robotics application, further investigation are needed for real-world deployment.

引用

页码：138 / 143

页数：6

共 16 条

[1]

Almeida F., Navigation of Simulated Adjacent Wheelchairs using Deep Reinforcement Learning"

[2]

[Anonymous], Markov games as a framework for multi-agent reinforcement learning"

[3] Navigation system for agricultural machines: Nonlinear Model Predictive path tracking [J].

Backman, J. ;

Oksanen, T. ;

Visala, A. .

COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2012, 82 :32-43

[4] Simulation Framework to Train Intelligent Agents towards an Assisted Driving Power Wheelchair for People with Disability [J].

Falzone, Giovanni ;

Giuffrida, Gianluca ;

Panicacci, Silvia ;

Donati, Massimiliano ;

Fanucci, Luca .

ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2021, :189-196

[5] A Survey on Intelligent Wheelchair Prototypes and Simulators [J].

Faria, Brigida Monica ;

Reis, Luis Paulo ;

Lau, Nuno .

NEW PERSPECTIVES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2014, 275 :545-557

[6] RETRACTED: Path Planning of a Multifunctional Elderly Intelligent Wheelchair Based on the Sensor and Fuzzy Bayesian Network Algorithm (Retracted Article) [J].

Kong, Jian ;

Li, Peng .

JOURNAL OF SENSORS, 2022, 2022

[7] Using Deep Reinforcement Learning for Navigation in Simulated Hallways [J].

Leao, Goncalo ;

Almeida, Filipe ;

Trigo, Emanuel ;

Ferreira, Henrique ;

Sousa, Armando ;

Reis, Luis Paulo .

2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2023, :207-213

[8]

Lowe R, 2017, ADV NEUR IN, V30

[9]

Maekawa Y., Modeling Eye -Gaze Behavior of Electric Wheelchair Drivers via Inverse Reinforcement Learning

[10]

Mnih V, 2013, Arxiv, DOI arXiv:1312.5602

← 1 2 →