Multi-Agent Formation Control With Obstacle Avoidance Using Proximal Policy Optimization

被引:7
作者
Sadhukhan, Priyam [1 ]
Selmic, Rastko R. [1 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
来源
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2021年
关键词
D O I
10.1109/SMC52423.2021.9658635
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a formation of second-order holonomic agents is made to navigate through an obstacle field using proximal policy optimization (PPO) based deep reinforcement learning (DRL). The angle-based formation is allowed to shrink while maintaining its shape in order to navigate through tight spaces and take the geometric centroid of the formation towards the goal. Two reward schemes are presented, one based on the actions of individual agents and another based on the actions of the team as a whole. For each case, all the agents share a single policy that is trained in a centralized manner. Distance measurements, state information, error information regarding neighboring agents, and simulation information are used for training each policy in an end-to-end fashion. Simulation results for both approaches are compared.
引用
收藏
页码:2694 / 2699
页数:6
相关论文
共 50 条
[21]   Formation Shaping Control for Multi-Agent Systems with Obstacle Avoidance and Dynamic Leader Selection [J].
Adderson, Ryan ;
Pan, Ya-Jun .
2022 IEEE 31ST INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2022, :1082-1087
[22]   Finite-time formation control and obstacle avoidance of multi-agent system with application [J].
Shou, Yingxin ;
Xu, Bin ;
Lu, Haibo ;
Zhang, Aidong ;
Mei, Tao .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) :2883-2901
[23]   Distributed optimal control for multi-agent systems with obstacle avoidance [J].
Chen, Yuanqing ;
Sun, Jitao .
NEUROCOMPUTING, 2016, 173 :2014-2021
[24]   Distributed Flocking Control and Obstacle Avoidance for Multi-Agent Systems [J].
Liu Huagang ;
Fang Hao ;
Mao Yutian ;
Cao Hu ;
Jia Rui .
PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, :4536-4541
[25]   Centralized Multi-Agent Proximal Policy Optimization with Attention [J].
Cazaux, Hugo ;
Rudd, Ralph ;
Stefansson, Hlynur ;
Olafsson, Sverrir ;
Asgeirsson, Eyjolfur Ingi .
2024 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2024, :834-840
[26]   Autonomous underwater vehicle formation control and obstacle avoidance using multi-agent generative adversarial imitation learning [J].
Fang, Zheng ;
Jiang, Dong ;
Huang, Jie ;
Cheng, Chunxi ;
Sha, Qixin ;
He, Bo ;
Li, Guangliang .
OCEAN ENGINEERING, 2022, 262
[27]   Autonomous underwater vehicle formation control and obstacle avoidance using multi-agent generative adversarial imitation learning [J].
Fang, Zheng ;
Jiang, Dong ;
Huang, Jie ;
Cheng, Chunxi ;
Sha, Qixin ;
He, Bo ;
Li, Guangliang .
Ocean Engineering, 2022, 262
[28]   Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning [J].
Yan, Yuzi ;
Li, Xiaoxiang ;
Qiu, Xinyou ;
Qiu, Jiantao ;
Wang, Jian ;
Wang, Yu ;
Shen, Yuan .
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, :1661-1667
[29]   Swarm Multi-agent Trapping Multi-target Control with Obstacle Avoidance [J].
Li, Chenyang ;
Jiang, Guanjie ;
Yang, Yonghui ;
Chen, XueBo .
ADVANCES IN SWARM INTELLIGENCE, ICSI 2023, PT II, 2023, 13969 :49-61
[30]   Multi-Agent Ergodic Coverage with Obstacle Avoidance [J].
Salman, Hadi ;
Ayvali, Elif ;
Choset, Howie .
TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, :242-249