Multi-Agent Formation Control With Obstacle Avoidance Using Proximal Policy Optimization

被引:7
作者
Sadhukhan, Priyam [1 ]
Selmic, Rastko R. [1 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
来源
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2021年
关键词
D O I
10.1109/SMC52423.2021.9658635
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a formation of second-order holonomic agents is made to navigate through an obstacle field using proximal policy optimization (PPO) based deep reinforcement learning (DRL). The angle-based formation is allowed to shrink while maintaining its shape in order to navigate through tight spaces and take the geometric centroid of the formation towards the goal. Two reward schemes are presented, one based on the actions of individual agents and another based on the actions of the team as a whole. For each case, all the agents share a single policy that is trained in a centralized manner. Distance measurements, state information, error information regarding neighboring agents, and simulation information are used for training each policy in an end-to-end fashion. Simulation results for both approaches are compared.
引用
收藏
页码:2694 / 2699
页数:6
相关论文
共 50 条
[31]   Experimental verification of coverage control of multi-agent systems with obstacle avoidance [J].
Siri, Fidelia Chaitra ;
Song, Jie ;
Svinin, Mikhail .
JOURNAL OF ENGINEERING-JOE, 2024, 2024 (12)
[32]   Covariance Matrix Adaptation for Multi-Agent Proximal Policy Optimization [J].
Shen, Yiou ;
Gao, Xiang ;
Liang, Zhiwei .
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, :4847-4852
[33]   Event-triggered formation control with obstacle avoidance for multi-agent systems applied to multi-UAV formation flying [J].
Han, Liang ;
Wang, Yue ;
Yan, Ziwei ;
Li, Xiaoduo ;
Ren, Zhang .
CONTROL ENGINEERING PRACTICE, 2024, 153
[34]   Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning [J].
Ji X. ;
Hai J. ;
Luo W. ;
Lin C. ;
Xiong Y. ;
Ou Z. ;
Wen J. .
Journal of Shanghai Jiaotong University (Science), 2021, 26 (05) :680-685
[35]   Consensus Formation of Multi-agent Systems with Obstacle Avoidance based on Event-triggered Impulsive Control [J].
Ji, Lianghao ;
Qu, Xiaofeng ;
Tang, Chengmei ;
Yang, Shasha ;
Guo, Xing ;
Li, Huaqing .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 109 (03)
[36]   Group motion control of multi-agent systems with obstacle avoidance: column formation under input constraints [J].
Niizuma, Naoaki ;
Nakano, Kazushi ;
Funato, Tetsuro ;
Higuchi, Kohji .
ARTIFICIAL LIFE AND ROBOTICS, 2015, 20 (01) :70-77
[37]   Formation Control and Obstacle Avoidance for Multi-Agent Systems Based on Virtual Leader-Follower Strategy [J].
Yan, Jing ;
Guan, Xinping ;
Luo, Xiaoyuan ;
Chen, Cailian .
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2017, 16 (03) :865-880
[38]   Consensus Formation of Multi-agent Systems with Obstacle Avoidance based on Event-triggered Impulsive Control [J].
Lianghao Ji ;
Xiaofeng Qu ;
Chengmei Tang ;
Shasha Yang ;
Xing Guo ;
Huaqing Li .
Journal of Intelligent & Robotic Systems, 2023, 109
[39]   The Effectiveness of State Representation Model in Multi-Agent Proximal Policy Optimization for Multi-Agent Path Finding [J].
Chung, Jaehoon ;
Fayyad, Jamil ;
Tamizi, Mehran Ghafarian ;
Najjaran, Homayoun .
2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2024), 2024, :9947-9952
[40]   Velocity Obstacle Approaches for Multi-Agent Collision Avoidance [J].
Douthwaite, James A. ;
Zhao, Shiyu ;
Mihaylova, Lyudmila S. .
UNMANNED SYSTEMS, 2019, 7 (01) :55-64