Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks

被引:25
|
作者
Gong, Shimin [1 ]
Wang, Meng [1 ]
Gu, Bo [1 ]
Zhang, Wenjie [2 ]
Dinh Thai Hoang [3 ]
Niyato, Dusit [4 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen 518063, Peoples R China
[2] Minnan Normal Univ, Sch Comp Sci, Fujian 363000, Peoples R China
[3] Univ Technol Sydney, Sch Elect & Data Engn, Sydney, NSW 2007, Australia
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
UAV network; trajectory planning; network formation; Bayesian optimization; deep reinforcement learning; ASSISTED DATA-COLLECTION; COOPERATIVE INTERNET; DESIGN; COMMUNICATION; MINIMIZATION;
D O I
10.1109/TVT.2023.3262778
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we employ multiple UAVs coordinated by a base station (BS) to help the ground users (GUs) to offload their sensing data. Different UAVs can adapt their trajectories and network formation to expedite data transmissions via multi-hop relaying. The trajectory planning aims to collect allGUs' data, while the UAVs' network formation optimizes the multi-hop UAV network topology to minimize the energy consumption and transmission delay. The joint network formation and trajectory optimization is solved by a two-step iterative approach. Firstly, we devise the adaptive network formation scheme by using a heuristic algorithm to balance the UAVs' energy consumption and data queue size. Then, with the fixed network formation, the UAVs' trajectories are further optimized by using multi-agent deep reinforcement learning without knowing the GUs' traffic demands and spatial distribution. To improve the learning efficiency, we further employ Bayesian optimization to estimate the UAVs' flying decisions based on historical trajectory points. This helps avoid inefficient action explorations and improves the convergence rate in the model training. The simulation results reveal close spatial-temporal couplings between the UAVs' trajectory planning and network formation. Compared with several baselines, our solution can better exploit the UAVs' cooperation in data offloading, thus improving energy efficiency and delay performance.
引用
收藏
页码:10933 / 10948
页数:16
相关论文
共 50 条
  • [21] Multi-Agent Deep Reinforcement Learning for Full-Duplex Multi-UAV Networks
    Dai, Chen
    Zhu, Kun
    Hossain, Ekram
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2232 - 2237
  • [22] Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks
    Chen, Peixin
    Zhao, Jian
    Shen, Furao
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 933 - 938
  • [23] Throughput Maximization in NOMA Enhanced RIS-Assisted Multi-UAV Networks: A Deep Reinforcement Learning Approach
    Tang, Runzhi
    Wang, Junxuan
    Zhang, Yanyan
    Jiang, Fan
    Zhang, Xuewei
    Du, Jianbo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (01) : 730 - 745
  • [24] Distributed Trajectory Optimization Method for Multi-UAV Formation Rendezvous over Directed Networks
    Han, Dongyu
    Ma, Aoyun
    Zhang, Yongpo
    Liu, Kun
    Xia, Yuanqing
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 6206 - 6211
  • [25] Distributed multi-UAV trajectory optimization over directed networks
    Liu, Tao
    Han, Dongyu
    Lin, Yeming
    Liu, Kun
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2021, 358 (10): : 5470 - 5487
  • [26] Trajectory and Beamforming Vector Optimization for Multi-UAV Multicast Network
    Feng, Chengcheng
    Zhang, Chao
    Luo, Xinmin
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [27] Multi-UAV Dynamic Wireless Networking With Deep Reinforcement Learning
    Wang, Qiang
    Zhang, Wenqi
    Liu, Yuanwei
    Liu, Ying
    IEEE COMMUNICATIONS LETTERS, 2019, 23 (12) : 2243 - 2246
  • [28] Multi-UAV Assisted Network Coverage Optimization for Rescue Operations using Reinforcement Learning
    Oubbati, Omar Sami
    Badis, Hakim
    Rachedi, Abderrezak
    Lakas, Abderrahmane
    Lorenz, Pascal
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [29] Deep reinforcement learning based trajectory design and resource allocation for task-aware multi-UAV enabled MEC networks
    Li, Zewu
    Xu, Chen
    Zhang, Zhanpeng
    Wu, Runze
    COMPUTER COMMUNICATIONS, 2024, 213 : 88 - 98
  • [30] Energy-Efficient Multi-UAV Network using Multi-Agent Deep Reinforcement Learning
    Ju, Hyungyu
    Shim, Byonghyo
    2022 IEEE VTS ASIA PACIFIC WIRELESS COMMUNICATIONS SYMPOSIUM, APWCS, 2022, : 70 - 74