Multi-Agent Reinforcement Learning for Side-by-Side Navigation of Autonomous Wheelchairs

被引:0
作者
Fonseca, Tiago [1 ]
Leao, Goncalo [2 ]
Ferreira, Luis Lino [1 ,3 ]
Sousa, Armando [2 ]
Severino, Ricardo [1 ]
Reis, Luis Paulo
机构
[1] Polytech Porto, ISEP, INESC TEC, Porto, Portugal
[2] Univ Porto, FEUP Fac Engn, INESC TEC, Porto, Portugal
[3] Univ Porto, FEUP Fac Engn, LIACC, Artificial Intelligence & Comp Sci Lab, Porto, Portugal
来源
2024 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC | 2024年
关键词
Intelligent Robotics; Multi-Agent; Reinforcement Learning; Robot Operating System (ROS); MODEL;
D O I
10.1109/ICARSC61747.2024.10535919
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper explores the use of Robotics and decentralized Multi-Agent Reinforcement Learning (MARL) for side-by-side navigation in Intelligent Wheelchairs (IW). Evolving from a previous work approach using traditional single-agent methodologies, it adopts a Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm to provide control input and enable a pair of IW to be deployed as decentralized computing agents in real-world environments, discarding the need to rely on communication between each other. In this study, the Flatland 2D simulator, in conjunction with the Robot Operating System (ROS), is used as a realistic environment to train and test the navigation algorithm. An overhaul of the reward function is introduced, which now provides individual rewards for each agent and revised reward incentives. Additionally, the logic for identifying side-by-side navigation was improved, to encourage dynamic alignment control. The preliminary results outline a promising research direction, with the IWs learning to navigate in various realistic hallways testing scenarios. The outcome also suggests that while the MADDPG approach holds potential over single-agent techniques for the decentralized IW robotics application, further investigation are needed for real-world deployment.
引用
收藏
页码:138 / 143
页数:6
相关论文
共 16 条
[1]  
Almeida F., Navigation of Simulated Adjacent Wheelchairs using Deep Reinforcement Learning"
[2]  
[Anonymous], Markov games as a framework for multi-agent reinforcement learning"
[3]   Navigation system for agricultural machines: Nonlinear Model Predictive path tracking [J].
Backman, J. ;
Oksanen, T. ;
Visala, A. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2012, 82 :32-43
[4]   Simulation Framework to Train Intelligent Agents towards an Assisted Driving Power Wheelchair for People with Disability [J].
Falzone, Giovanni ;
Giuffrida, Gianluca ;
Panicacci, Silvia ;
Donati, Massimiliano ;
Fanucci, Luca .
ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2021, :189-196
[5]   A Survey on Intelligent Wheelchair Prototypes and Simulators [J].
Faria, Brigida Monica ;
Reis, Luis Paulo ;
Lau, Nuno .
NEW PERSPECTIVES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2014, 275 :545-557
[6]   RETRACTED: Path Planning of a Multifunctional Elderly Intelligent Wheelchair Based on the Sensor and Fuzzy Bayesian Network Algorithm (Retracted Article) [J].
Kong, Jian ;
Li, Peng .
JOURNAL OF SENSORS, 2022, 2022
[7]   Using Deep Reinforcement Learning for Navigation in Simulated Hallways [J].
Leao, Goncalo ;
Almeida, Filipe ;
Trigo, Emanuel ;
Ferreira, Henrique ;
Sousa, Armando ;
Reis, Luis Paulo .
2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2023, :207-213
[8]  
Lowe R, 2017, ADV NEUR IN, V30
[9]  
Maekawa Y., Modeling Eye -Gaze Behavior of Electric Wheelchair Drivers via Inverse Reinforcement Learning
[10]  
Mnih V, 2013, Arxiv, DOI arXiv:1312.5602