Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

被引:14
作者
Zhao, Xiaoru [1 ]
Yang, Rennong [1 ]
Zhong, Liangsheng [2 ]
Hou, Zhiwei [2 ]
机构
[1] Air Force Engn Univ, Air Traff Control & Nav Sch, Xian 710051, Peoples R China
[2] Sun Yat Sen Univ, Sch Syst Sci & Engn, Guangzhou 510275, Peoples R China
关键词
path planning; path follow; deep reinforcement learning; multi-UAV; parameter share;
D O I
10.3390/drones8010018
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Dedicated to meeting the growing demand for multi-agent collaboration in complex scenarios, this paper introduces a parameter-sharing off-policy multi-agent path planning and the following approach. Current multi-agent path planning predominantly relies on grid-based maps, whereas our proposed approach utilizes laser scan data as input, providing a closer simulation of real-world applications. In this approach, the unmanned aerial vehicle (UAV) uses the soft actor-critic (SAC) algorithm as a planner and trains its policy to converge. This policy enables end-to-end processing of laser scan data, guiding the UAV to avoid obstacles and reach the goal. At the same time, the planner incorporates paths generated by a sampling-based method as following points. The following points are continuously updated as the UAV progresses. Multi-UAV path planning tasks are facilitated, and policy convergence is accelerated through sharing experiences among agents. To address the challenge of UAVs that are initially stationary and overly cautious near the goal, a reward function is designed to encourage UAV movement. Additionally, a multi-UAV simulation environment is established to simulate real-world UAV scenarios to support training and validation of the proposed approach. The simulation results highlight the effectiveness of the presented approach in both the training process and task performance. The presented algorithm achieves an 80% success rate to guarantee that three UAVs reach the goal points.
引用
收藏
页数:18
相关论文
共 50 条
[31]   A Review of Multi-UAV Collaboration in Adversarial Environments Based on Deep Reinforcement Learning [J].
Liu, Yuting ;
Zhang, Hang ;
Zhao, Hongyin .
ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL, VOL 2, 2025, 1338 :533-540
[32]   Multi-UAV path planning with minimum information delay [J].
Chen Y. ;
Zhong S. ;
Chen Z. .
Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2024, 32 (05) :521-530
[33]   Attention-Based Communication and Control for Multi-UAV Path Planning [J].
Shiri, Hamid ;
Seo, Hyowoon ;
Park, Jihong ;
Bennis, Mehdi .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (07) :1409-1413
[34]   Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning [J].
Zhou, Yatong ;
Kong, Xiaoran ;
Lin, Kuo-Ping ;
Liu, Liangyu .
KNOWLEDGE-BASED SYSTEMS, 2024, 287
[35]   Multi-agent policy learning-based path planning for autonomous mobile robots [J].
Zhang, Lixiang ;
Cai, Ze ;
Yan, Yan ;
Yang, Chen ;
Hu, Yaoguang .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
[36]   Balanced Multi-UAV path planning for persistent monitoring [J].
Zhan, Xinru ;
Chen, Yang ;
Chen, Xi ;
Zhang, Wenhao .
ROBOTICA, 2025, 43 (01) :332-349
[37]   An adaptive Q-learning based particle swarm optimization for multi-UAV path planning [J].
Tan L. ;
Zhang H. ;
Liu Y. ;
Yuan T. ;
Jiang X. ;
Shang Z. .
Soft Computing, 2024, 28 (13-14) :7931-7946
[38]   Autonomous Multi-UAV Path Planning in Pipe Inspection Missions Based on Booby Behavior [J].
Aljalaud, Faten ;
Kurdi, Heba ;
Youcef-Toumi, Kamal .
MATHEMATICS, 2023, 11 (09)
[39]   Efficient Strategy for Multi-UAV Path Planning in Target Coverage Problems [J].
Pehlivanoglu, Y. Volkan ;
Bekmezci, Ilker ;
Pehlivanoglu, Perihan .
2022 INTERNATIONAL CONFERENCE ON THEORETICAL AND APPLIED COMPUTER SCIENCE AND ENGINEERING (ICTASCE), 2022, :110-115
[40]   Intelligent Optimization Algorithms for Multi-UAV Path Planning: A Comprehensive Review [J].
Zhai, Lixiang ;
Wu, HuSheng ;
Lai, Linghong ;
Gao, Ziqian .
IEEE ACCESS, 2025, 13 :101106-101130