Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks

被引：34

作者：

Gong, Shimin ^{[1
]}

Wang, Meng ^{[1
]}

Gu, Bo ^{[1
]}

Zhang, Wenjie ^{[2
]}

Dinh Thai Hoang ^{[3
]}

Niyato, Dusit ^{[4
]}

机构：

[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen 518063, Peoples R China

[2] Minnan Normal Univ, Sch Comp Sci, Fujian 363000, Peoples R China

[3] Univ Technol Sydney, Sch Elect & Data Engn, Sydney, NSW 2007, Australia

[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2023年 / 72卷 / 08期

基金：

新加坡国家研究基金会; 美国国家科学基金会;

关键词：

UAV network; trajectory planning; network formation; Bayesian optimization; deep reinforcement learning; ASSISTED DATA-COLLECTION; COOPERATIVE INTERNET; DESIGN; COMMUNICATION; MINIMIZATION;

D O I：

10.1109/TVT.2023.3262778

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we employ multiple UAVs coordinated by a base station (BS) to help the ground users (GUs) to offload their sensing data. Different UAVs can adapt their trajectories and network formation to expedite data transmissions via multi-hop relaying. The trajectory planning aims to collect allGUs' data, while the UAVs' network formation optimizes the multi-hop UAV network topology to minimize the energy consumption and transmission delay. The joint network formation and trajectory optimization is solved by a two-step iterative approach. Firstly, we devise the adaptive network formation scheme by using a heuristic algorithm to balance the UAVs' energy consumption and data queue size. Then, with the fixed network formation, the UAVs' trajectories are further optimized by using multi-agent deep reinforcement learning without knowing the GUs' traffic demands and spatial distribution. To improve the learning efficiency, we further employ Bayesian optimization to estimate the UAVs' flying decisions based on historical trajectory points. This helps avoid inefficient action explorations and improves the convergence rate in the model training. The simulation results reveal close spatial-temporal couplings between the UAVs' trajectory planning and network formation. Compared with several baselines, our solution can better exploit the UAVs' cooperation in data offloading, thus improving energy efficiency and delay performance.

引用

页码：10933 / 10948

页数：16

共 42 条

[1] Average Peak Age-of-Information Minimization in UAV-Assisted IoT Networks [J].

Abd-Elmagid, Mohamed A. ;

Dhillon, Harpreet S. .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) :2003-2008

[2] 3-D Placement of an Unmanned Aerial Vehicle Base Station for Maximum Coverage of Users With Different QoS Requirements [J].

Alzenad, Mohamed ;

El-Keyi, Amr ;

Yanikomeroglu, Halim .

IEEE WIRELESS COMMUNICATIONS LETTERS, 2018, 7 (01) :38-41

[3] A UAV-Assisted Data Collection for Wireless Sensor Networks: Autonomous Navigation and Scheduling [J].

Bouhamed, Omar ;

Ghazzai, Hakim ;

Besbes, Hichem ;

Massoud, Yehia .

IEEE ACCESS, 2020, 8 :110446-110460

[4]

Brochu E., 2010, CoRR, V2599

[5]

Challita U, 2017, IEEE GLOB COMM CONF

[6] Joint Position and Resource Optimization for Multi-UAV-Aided Relaying Systems [J].

Chen, Qinbo .

IEEE ACCESS, 2020, 8 (08) :10403-10415

[7] Fast or Slow: An Autonomous Speed Control Approach for UAV-assisted IoT Data Collection Networks [J].

Chu, Nam H. ;

Dinh Thai Hoang ;

Nguyen, Diep N. ;

Nguyen Van Huynh ;

Dutkiewicz, Eryk .

2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,

[8] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks [J].

Cui, Jingjing ;

Liu, Yuanwei ;

Nallanathan, Arumugam .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) :729-743

[9] Coarse Trajectory Design for Energy Minimization in UAV-Enabled [J].

Dinh-Hieu Tran ;

Vu, Thang X. ;

Chatzinotas, Symeon ;

ShahbazPanahi, Shahram ;

Ottersten, Bjorn .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (09) :9483-9496

[10] Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial [J].

Feriani, Amal ;

Hossain, Ekram .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (02) :1226-1252

← 1 2 3 4 5 →