Optimization of Urban Target Area Accessibility for Multi-UAV Data Gathering Based on Deep Reinforcement Learning

被引：0

作者：

Jin, Zhengmiao ^{[1
]}

Chen, Renxiang ^{[2
]}

Wu, Ke ^{[1
]}

Yu, Tengwei ^{[2
]}

Fu, Linghua ^{[1
]}

机构：

[1] Chongqing Jiaotong Univ, Sch Aeronaut, Chongqing 404100, Peoples R China

[2] Chongqing Jiaotong Univ, Sch Mechatron & Vehicle Engn, Chongqing, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 09期

基金：

中国国家自然科学基金;

关键词：

multi-UAV; data gathering; path planning; reinforcement learning (RL); exploration;

D O I：

10.3390/drones8090462

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Unmanned aerial vehicles (UAVs) are increasingly deployed to enhance the operational efficiency of city services. However, finding optimal solutions for the gather-return task pattern under dynamic environments and the energy constraints of UAVs remains a challenge, particularly in dense high-rise building areas. This paper investigates the multi-UAV path planning problem, aiming to optimize solutions and enhance data gathering rates by refining exploration strategies. Initially, for the path planning problem, a reinforcement learning (RL) technique equipped with an environment reset strategy is adopted, and the data gathering problem is modeled as a maximization problem. Subsequently, to address the limitations of stationary distribution in indicating the short-term behavioral patterns of agents, a Time-Adaptive Distribution is proposed, which evaluates and optimizes the policy by combining the behavioral characteristics of agents across different time scales. This approach is particularly suitable for the early stages of learning. Furthermore, the paper describes and defines the "Narrow-Elongated Path" Problem (NEP-Problem), a special spatial configuration in RL environments that hinders agents from finding optimal solutions through random exploration. To address this, a Robust-Optimization Exploration Strategy is introduced, leveraging expert knowledge and robust optimization to ensure UAVs can deterministically reach and thoroughly explore any target areas. Finally, extensive simulation experiments validate the effectiveness of the proposed path planning algorithms and comprehensively analyze the impact of different exploration strategies on data gathering efficiency.

引用

页数：37

共 50 条

[21] Deep Reinforcement Learning for Multi-UAV Exploration Under Energy Constraints
Zhou, Yating
Shi, Dianxi
Yang, Huanhuan
Hu, Haomeng
Yang, Shaowu
Zhang, Yongjun
COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT II, 2022, 461 : 363 - 379
[22] Multi-UAV trajectory optimizer: A sustainable system for wireless data harvesting with deep reinforcement learning
Seong, Mincheol
Jo, Ohyun
Shin, Kyungseop
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[23] Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method
Liao, Guang
Wang, Jian
Yang, Dujia
Yang, Junan
SENSORS, 2024, 24 (21)
[24] Maintaining Connectivity for Multi-UAV Multi-Target Search Using Reinforcement Learning
Guven, Islam
Yanmaz, Evsen
PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON DESIGN AND ANALYSIS OF INTELLIGENT VEHICULAR NETWORKS AND APPLICATIONS, DIVANET 2023, 2023, : 109 - 114
[25] Multi-UAV Reinforcement Learning for Data Collection in Cellular MIMO Networks
Diaz-Vilor, Carles
Abdelhady, Amr M.
Eltawil, Ahmed M.
Jafarkhani, Hamid
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 15462 - 15476
[26] Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks
Gong, Shimin
Wang, Meng
Gu, Bo
Zhang, Wenjie
Dinh Thai Hoang
Niyato, Dusit
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) : 10933 - 10948
[27] Federated Deep Reinforcement Learning-Based Multi-UAV Navigation for Heterogeneous NOMA Systems
Rezwan, Sifat
Chun, Chanjun
Choi, Wooyeol
IEEE SENSORS JOURNAL, 2023, 23 (23) : 29722 - 29732
[28] Joint Multi-UAV Deployment and Resource Allocation based on Personalized Federated Deep Reinforcement Learning
Xu, Xinyi
Feng, Gang
Qin, Shuang
Liu, Yijing
Sun, Yao
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5677 - 5682
[29] Multi-UAV Collaborative Surveillance Network Recovery via Deep Reinforcement Learning
Zhang, Jingbin
Wang, Tao
Wang, Jingjing
Du, Wenbo
Zheng, Dezhi
Wang, Shuai
Li, Yumeng
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (21): : 34528 - 34540
[30] Autonomous target tracking of multi-UAV: A two-stage deep reinforcement learning approach with expert experience
Wang, Jiahua
Zhang, Ping
Wang, Yang
APPLIED SOFT COMPUTING, 2023, 145

← 1 2 3 4 5 →