GAN-powered heterogeneous multi-agent reinforcement learning for UAV-assisted task

被引:7
作者
Li, Yangyang [1 ]
Feng, Lei [1 ]
Yang, Yang [1 ]
Li, Wenjing [1 ]
机构
[1] BUPT, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicle; Task offloading; Multi-agent reinforcement learning; Heterogeneous agents; Generative adversarial network; RESOURCE-ALLOCATION; NETWORKS; MEC;
D O I
10.1016/j.adhoc.2023.103341
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The flexible and highly mobile unmanned aerial vehicle (UAV) with computing capabilities can improve the quality of experience (QoE) of ground users (GUs) according to real-time service requirements by performing flight maneuvers. In this study, we investigate a task offloading scheme and trajectory optimization in a multi-UAV-assisted system, where UAVs offload a portion of multiple GUs' computational tasks. The optimization problem is formulated to jointly minimize the energy consumption of UAVs and the task latency of GUs by optimizing trajectory, task allocation and offloading proportion. This paper proposes a heterogeneous multi-agent reinforcement learning (MARL)-based approach to solve the issue in high dimensions and limited states, where UAV and GU are treated separately as two different types of agents. Due to the high cost and low sample efficiency of online training of RL algorithms, a generative adversarial network (GAN)-powered auxiliary training mechanism is proposed, which reduces the overhead of interacting with the real world and makes the agent's policy appropriate for real-world execution environment via offline training with generated environment states. Numerical evaluation results demonstrate that the proposed algorithm outperforms other benchmark algorithms in terms of energy consumption and task latency.
引用
收藏
页数:14
相关论文
共 31 条
[1]   Optimal Trajectory Learning for UAV-Mounted Mobile Base Stations using RL and Greedy Algorithms [J].
Bhandarkar, Adhitya Bantwal ;
Jayaweera, Sudharman K. .
2021 17TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB 2021), 2021, :13-18
[2]  
Brik B, 2021, IEEE NETWORK, V35, P74, DOI [10.1109/MNET.011.2000577, 10.1109/MNET.011.2000587]
[3]   Energy-Efficient Task Offloading and Resource Allocation via Deep Reinforcement Learning for Augmented Reality in Mobile Edge Networks [J].
Chen, Xing ;
Liu, Guizhong .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (13) :10843-10856
[4]   UAV Trajectory Optimization for Data Offloading at the Edge of Multiple Cells [J].
Cheng, Fen ;
Zhang, Shun ;
Li, Zan ;
Chen, Yunfei ;
Zhao, Nan ;
Yu, F. Richard ;
Leung, Victor C. M. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (07) :6732-6736
[5]   Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks [J].
Cui, Jingjing ;
Liu, Yuanwei ;
Nallanathan, Arumugam .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) :729-743
[6]   UAV-Relaying-Assisted Multi-Access Edge Computing With Multi-Antenna Base Station: Offloading and Scheduling Optimization [J].
Diao, Xianbang ;
Yang, Wendong ;
Yang, Lianxin ;
Cai, Yueming .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (09) :9495-9509
[7]   Optimal Node Placement and Resource Allocation for UAV Relaying Network [J].
Fan, Rongfei ;
Cui, Jiannan ;
Jin, Song ;
Yang, Kai ;
An, Jianping .
IEEE COMMUNICATIONS LETTERS, 2018, 22 (04) :808-811
[8]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[9]   Game Combined Multi-Agent Reinforcement Learning Approach for UAV Assisted Offloading [J].
Gao, Ang ;
Wang, Qi ;
Liang, Wei ;
Ding, Zhiguo .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) :12888-12901
[10]   Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks [J].
Gong, Shimin ;
Wang, Meng ;
Gu, Bo ;
Zhang, Wenjie ;
Dinh Thai Hoang ;
Niyato, Dusit .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) :10933-10948