Joint UAV Placement Optimization, Resource Allocation, and Computation Offloading for THz Band: A DRL Approach

被引：40

作者：

Wang, Heng ^{[1
]}

Zhang, Haijun ^{[1
]}

Liu, Xiangnan ^{[1
]}

Long, Keping ^{[1
]}

Nallanathan, Arumugam ^{[2
]}

机构：

[1] Univ Sci & Technol Beijing, Beijing Adv Innovat Ctr Mat Genome Engn, Beijing Engn & Technol Res Ctr Convergence Network, Beijing 100083, Peoples R China

[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2023年 / 22卷 / 07期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Resource management; Task analysis; Servers; Optimization; Wireless communication; Heuristic algorithms; Delays; MEC; resource allocation; Index Terms; UAV; THz frequency band; DRL; INDUSTRIAL INTERNET; POWER OPTIMIZATION; NETWORKS; THINGS;

D O I：

10.1109/TWC.2022.3230407

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the development of internet of things, latency-sensitive applications such as telemedicine are constantly emerging. Unfortunately, due to the limited computation capacity of wireless user devices, the real-time demands can not be met. Multi-access edge computing (MEC), which enables the deployment of edge access points (E-APs) to support computation-intensive applications, has become an effective way to meet the real-time demands. However, the number of WUDs that E-APs can serve are limited. To increase system capacity, the unmanned aerial vehicle (UAV) assisted computation offloading architecture in the terahertz (THz) band is proposed. In this paper, the problem of UAV placement optimization, resource allocation, and computation offloading is investigated considering the quality of service and resource constraints. The joint optimization problem is non-convex and hard to be solved in time by using traditional algorithms, such as successive convex approximation. Therefore, deep reinforcement learning (DRL) based approach is a promising way to solve the formulated non-convex problem of minimizing latency. Double deep Q-learning (DDQN) and deep deterministic policy gradient (DDPG) algorithms are provided to search for near-optimal solutions in highly dynamic environments. The effectiveness of the proposed algorithms is proved by simulation results in different scenarios.

引用

页码：4890 / 4900

页数：11

共 32 条

[31] A Double Deep Q-Learning Model for Energy-Efficient Edge Scheduling [J].

Zhang, Qingchen ;

Lin, Man ;

Yang, Laurence T. ;

Chen, Zhikui ;

Khan, Samee U. ;

Li, Peng .

IEEE TRANSACTIONS ON SERVICES COMPUTING, 2019, 12 (05) :739-749

[32] Computation Rate Maximization in UAV-Enabled Wireless-Powered Mobile-Edge Computing Systems [J].

Zhou, Fuhui ;

Wu, Yongpeng ;

Hu, Rose Qingyang ;

Qian, Yi .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :1927-1941

← 1 2 3 4 →