Computing Over the Sky: Joint UAV Trajectory and Task Offloading Scheme Based on Optimization-Embedding Multi-Agent Deep Reinforcement Learning

被引：8

作者：

Li, Xuanheng ^{[1
]}

Du, Xinyang ^{[1
]}

Zhao, Nan ^{[1
]}

Wang, Xianbin ^{[2
]}

机构：

[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China

[2] Western Univ, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada

来源：

IEEE TRANSACTIONS ON COMMUNICATIONS | 2024年 / 72卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Autonomous aerial vehicles; Task analysis; Trajectory; Heuristic algorithms; Delays; Reinforcement learning; Resource management; Unmanned aerial vehicle; mobile edge computing; computation offloading; trajectory control; reinforcement learning; RESOURCE-ALLOCATION; ENERGY EFFICIENCY; ALGORITHM; NETWORKS; TIME;

D O I：

10.1109/TCOMM.2023.3331029

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) has emerged to support computation-intensive tasks in 6G systems. Since the battery capacity of a UAV is limited, to serve as many users as possible, a joint design on UAV trajectory and offloading strategy with consideration for service fairness is essential to provide energy-efficient computation offloading to the users in UAV-MEC networks. Unfortunately, such a joint decision-making problem is not straightforward due to various task types required from users and various functionalities of different UAVs enabled by different application programs. Considering the above issues, we take energy efficiency and service fairness as the objective, and propose a Multi-Agent Energy-Efficient joint Trajectory and Computation Offloading (MA-ETCO) scheme. To adapt to dynamic demands of users, we develop an optimization-embedding multi-agent deep reinforcement learning (OMADRL) algorithm. Each UAV autonomously learns the trajectory control decision based on MADRL to adapt to dynamic demands. Then, it will obtain the optimal computation offloading decision by solving a mixed-integer nonlinear programming problem. The computation offloading result, in turn, will be used as an indicator to guide UAVs' trajectory design. Compared to relying solely on deep reinforcement learning, such an optimization-embedding way reduces action space dimension and improves convergence efficiency.

引用

页码：1355 / 1369

页数：15

共 55 条

[31] User Mobility-Aware Time Stamp for UAV-BS Placement [J].

Peer, Mansi ;

Bohara, Vivek Ashok ;

Srivastava, Anand ;

Ghatak, Gourab .

2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2021,

[32] Opti-U: Optimal UAV Selection for Enabling UAV-as-a-Service [J].

Roy, Arijit ;

Bouvry, Pascal .

IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022,

[33] Collaborative Computation Offloading and Resource Allocation in Multi-UAV-Assisted IoT Networks: A Deep Reinforcement Learning Approach [J].

Seid, Abegaz Mohammed ;

Boateng, Gordon Owusu ;

Anokye, Stephen ;

Kwantwi, Thomas ;

Sun, Guolin ;

Liu, Guisong .

IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (15) :12203-12218

[34] Optimizing Multi-UAV Deployment in 3-D Space to Minimize Task Completion Time in UAV-Enabled Mobile Edge Computing Systems [J].

Sun, Sujunjie ;

Zhang, Guopeng ;

Mei, Haibo ;

Wang, Kezhi ;

Yang, Kun .

IEEE COMMUNICATIONS LETTERS, 2021, 25 (02) :579-583

[35] Toward Big Data Processing in IoT: Path Planning and Resource Management of UAV Base Stations in Mobile-Edge Computing System [J].

Wan, Shuo ;

Lu, Jiaxun ;

Fan, Pingyi ;

Letaief, Khaled B. .

IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) :5995-6009

[36] Unmanned-Aerial-Vehicle-Assisted Computation Offloading for Mobile Edge Computing Based on Deep Reinforcement Learning [J].

Wang, Hui ;

Ke, Hongchang ;

Sun, Weijia .

IEEE ACCESS, 2020, 8 :180784-180798

[37] Deep Reinforcement Learning-Based Resource Management for Flexible Mobile Edge Computing Architectures, Applications, and Research Issues [J].

Wang, Kezhi ;

Wang, Liang ;

Pan, Cunhua ;

Ren, Hong .

IEEE VEHICULAR TECHNOLOGY MAGAZINE, 2022, 17 (02) :85-93

[38] Multi-Agent Deep Reinforcement Learning-Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing [J].

Wang, Liang ;

Wang, Kezhi ;

Pan, Cunhua ;

Xu, Wei ;

Aslam, Nauman ;

Hanzo, Lajos .

IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) :73-84

[39] Dynamic Resource Scheduling in Mobile Edge Cloud with Cloud Radio Access Network [J].

Wang, Xinhou ;

Wang, Kezhi ;

Wu, Song ;

Di, Sheng ;

Jin, Hai ;

Yang, Kun ;

Ou, Shumao .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (11) :2429-2445

[40] Feasibility Study of UAV-Assisted Anti-Jamming Positioning [J].

Wang, Zijie ;

Liu, Rongke ;

Liu, Qirui ;

Han, Lincong ;

Thompson, John S. .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) :7718-7733

← 1 2 3 4 5 6 →