Optimizing UAV-UGV coalition operations: A hybrid clustering and multi-agent reinforcement learning approach for path planning in obstructed environment

被引：9

作者：

Brotee, Shamyo ^{[1
]}

Kabir, Farhan ^{[1
]}

Razzaque, Md. Abdur ^{[1
]}

Roy, Palash ^{[2
]}

Mamun-Or-Rashid, Md. ^{[1
]}

Hassan, Md. Rafiul ^{[3
]}

Hassan, Mohammad Mehedi ^{[4
]}

机构：

[1] Univ Dhaka, Green Networking Res Grp, Dept Comp Sci & Engn, Dhaka, Bangladesh

[2] Green Univ Bangladesh, Dept Comp Sci & Engn, Dhaka, Bangladesh

[3] Univ Maine Presque Isle, Coll Arts & Sci, Presque Isle, ME 04769 USA

[4] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh, Saudi Arabia

来源：

AD HOC NETWORKS | 2024年 / 160卷

关键词：

UAV-UGV coalition; Path planning; Multi-agent deep reinforcement learning; Mean-shift clustering; Obstructed environment;

D O I：

10.1016/j.adhoc.2024.103519

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

One of the most critical applications undertaken by Unmanned Aerial Vehicles (UAVs) and Unmanned Ground Vehicles (UGVs) is reaching predefined targets by following the most time-efficient routes while avoiding collisions. Unfortunately, UAVs are hampered by limited battery life, and UGVs face challenges in reachability due to obstacles and elevation variations, which is why a coalition of UAVs and UGVs can be highly effective. Existing literature primarily focuses on one-to-one coalitions, which constrains the efficiency of reaching targets. In this work, we introduce a novel approach for a UAV-UGV coalition with a variable number of vehicles, employing a modified mean-shift clustering algorithm (MEANCRFT) to segment targets into multiple zones. This approach of assigning targets to various circular zones based on density and range significantly reduces the time required to reach these targets. Moreover, introducing variability in the number of UAVs and UGVs in a coalition enhances task efficiency by enabling simultaneous multi-target engagement. In our approach, each vehicle of the coalitions is trained using two advanced deep reinforcement learning algorithms in two separate experiments, namely Multi-agent Deep Deterministic Policy Gradient (MADDPG) and Multi- agent Proximal Policy Optimization (MAPPO). The results of our experimental evaluation demonstrate that the proposed MEANCRFT-MADDPG method substantially surpasses current state-of-the-art techniques, nearly doubling efficiency in terms of target navigation time and task completion rate.

引用

页数：16

共 34 条

[1] Coordinated Target Assignment and UAV Path Planning with Timing Constraints [J].

Babel, Luitpold .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2019, 94 (3-4) :857-869

[2] A Clustering-Based Coverage Path Planning Method for Autonomous Heterogeneous UAVs [J].

Chen, Jinchao ;

Du, Chenglie ;

Zhang, Ying ;

Han, Pengcheng ;

Wei, Wei .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) :25546-25556

[3] An Adaptive Clustering-Based Algorithm for Automatic Path Planning of Heterogeneous UAVs [J].

Chen, Jinchao ;

Zhang, Ying ;

Wu, Lianwei ;

You, Tao ;

Ning, Xin .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) :16842-16853

[4] Multi-UAV Task Assignment With Parameter and Time-Sensitive Uncertainties Using Modified Two-Part Wolf Pack Search Algorithm [J].

Chen, Yongbo ;

Yang, Di ;

Yu, Jianqiao .

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2018, 54 (06) :2853-2872

[5] Multi-Agent Distributed Deep Deterministic Policy Gradient for Partially Observable Tracking [J].

Fan, Dongyu ;

Shen, Haikuo ;

Dong, Lijing .

ACTUATORS, 2021, 10 (10)

[6] Meta Proximal Policy Optimization for Cooperative Multi-Agent Continuous Control [J].

Fang, Boli ;

Peng, Zhenghao ;

Sun, Hao ;

Zhang, Qin .

2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,

[7] Cooperative UAV Trajectory Design for Disaster Area Emergency Communications: A Multiagent PPO Method [J].

Guan, Yue ;

Zou, Sai ;

Peng, Haixia ;

Ni, Wei ;

Sun, Yanglong ;

Gao, Hongfeng .

IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05) :8848-8859

[8]

Hu J, 2022, ADV NEUR IN

[9] Deep Reinforcement Learning for Safe Local Planning of a Ground Vehicle in Unknown Rough Terrain [J].

Josef, Shirel ;

Degani, Amir .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) :6748-6755

[10] Autonomous Shepherding Behaviors of Multiple Target Steering Robots [J].

Lee, Wonki ;

Kim, DaeEun .

SENSORS, 2017, 17 (12)

← 1 2 3 4 →