Optimization of Urban Target Area Accessibility for Multi-UAV Data Gathering Based on Deep Reinforcement Learning

被引:0
|
作者
Jin, Zhengmiao [1 ]
Chen, Renxiang [2 ]
Wu, Ke [1 ]
Yu, Tengwei [2 ]
Fu, Linghua [1 ]
机构
[1] Chongqing Jiaotong Univ, Sch Aeronaut, Chongqing 404100, Peoples R China
[2] Chongqing Jiaotong Univ, Sch Mechatron & Vehicle Engn, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-UAV; data gathering; path planning; reinforcement learning (RL); exploration;
D O I
10.3390/drones8090462
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Unmanned aerial vehicles (UAVs) are increasingly deployed to enhance the operational efficiency of city services. However, finding optimal solutions for the gather-return task pattern under dynamic environments and the energy constraints of UAVs remains a challenge, particularly in dense high-rise building areas. This paper investigates the multi-UAV path planning problem, aiming to optimize solutions and enhance data gathering rates by refining exploration strategies. Initially, for the path planning problem, a reinforcement learning (RL) technique equipped with an environment reset strategy is adopted, and the data gathering problem is modeled as a maximization problem. Subsequently, to address the limitations of stationary distribution in indicating the short-term behavioral patterns of agents, a Time-Adaptive Distribution is proposed, which evaluates and optimizes the policy by combining the behavioral characteristics of agents across different time scales. This approach is particularly suitable for the early stages of learning. Furthermore, the paper describes and defines the "Narrow-Elongated Path" Problem (NEP-Problem), a special spatial configuration in RL environments that hinders agents from finding optimal solutions through random exploration. To address this, a Robust-Optimization Exploration Strategy is introduced, leveraging expert knowledge and robust optimization to ensure UAVs can deterministically reach and thoroughly explore any target areas. Finally, extensive simulation experiments validate the effectiveness of the proposed path planning algorithms and comprehensively analyze the impact of different exploration strategies on data gathering efficiency.
引用
收藏
页数:37
相关论文
共 50 条
  • [21] Deep Reinforcement Learning for Multi-UAV Exploration Under Energy Constraints
    Zhou, Yating
    Shi, Dianxi
    Yang, Huanhuan
    Hu, Haomeng
    Yang, Shaowu
    Zhang, Yongjun
    COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT II, 2022, 461 : 363 - 379
  • [22] Multi-UAV trajectory optimizer: A sustainable system for wireless data harvesting with deep reinforcement learning
    Seong, Mincheol
    Jo, Ohyun
    Shin, Kyungseop
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [23] Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method
    Liao, Guang
    Wang, Jian
    Yang, Dujia
    Yang, Junan
    SENSORS, 2024, 24 (21)
  • [24] Maintaining Connectivity for Multi-UAV Multi-Target Search Using Reinforcement Learning
    Guven, Islam
    Yanmaz, Evsen
    PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON DESIGN AND ANALYSIS OF INTELLIGENT VEHICULAR NETWORKS AND APPLICATIONS, DIVANET 2023, 2023, : 109 - 114
  • [25] Multi-UAV Reinforcement Learning for Data Collection in Cellular MIMO Networks
    Diaz-Vilor, Carles
    Abdelhady, Amr M.
    Eltawil, Ahmed M.
    Jafarkhani, Hamid
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 15462 - 15476
  • [26] Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks
    Gong, Shimin
    Wang, Meng
    Gu, Bo
    Zhang, Wenjie
    Dinh Thai Hoang
    Niyato, Dusit
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) : 10933 - 10948
  • [27] Federated Deep Reinforcement Learning-Based Multi-UAV Navigation for Heterogeneous NOMA Systems
    Rezwan, Sifat
    Chun, Chanjun
    Choi, Wooyeol
    IEEE SENSORS JOURNAL, 2023, 23 (23) : 29722 - 29732
  • [28] Joint Multi-UAV Deployment and Resource Allocation based on Personalized Federated Deep Reinforcement Learning
    Xu, Xinyi
    Feng, Gang
    Qin, Shuang
    Liu, Yijing
    Sun, Yao
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5677 - 5682
  • [29] Multi-UAV Collaborative Surveillance Network Recovery via Deep Reinforcement Learning
    Zhang, Jingbin
    Wang, Tao
    Wang, Jingjing
    Du, Wenbo
    Zheng, Dezhi
    Wang, Shuai
    Li, Yumeng
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (21): : 34528 - 34540
  • [30] Autonomous target tracking of multi-UAV: A two-stage deep reinforcement learning approach with expert experience
    Wang, Jiahua
    Zhang, Ping
    Wang, Yang
    APPLIED SOFT COMPUTING, 2023, 145