Multi-Agent Deep Reinforcement Learning Based UAV Trajectory Optimization for Differentiated Services

被引：38

作者：

Ning, Zhaolong ^{[1
]}

Yang, Yuxuan ^{[2
]}

Wang, Xiaojie ^{[1
]}

Song, Qingyang ^{[1
]}

Guo, Lei ^{[1
]}

Jamalipour, Abbas ^{[2
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China

[2] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2050, Australia

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 05期

关键词：

Autonomous aerial vehicles; Servers; Computational efficiency; Task analysis; Trajectory optimization; Resource management; Costs; Multi-access edge computing; UAV-assisted communications; game theory; multi-agent DRL; RESOURCE-ALLOCATION; TASK;

D O I：

10.1109/TMC.2023.3312276

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Driven by the increasing computational demand of real-time mobile applications, Unmanned Aerial Vehicle (UAV) assisted Multi-access Edge Computing (MEC) has been envisioned as a promising paradigm for pushing computational resources to network edges and constructing high-throughput line-of-sight links for ground users. Most exsiting studies consider simplified scenarios, such as a single UAV, Service Provider (SP) or service type, and centralized UAV trajectory control. In order to be more in line with real-world cases, we intend to achieve distributed trajectory control of multiple UAVs in UAV-assisted MEC networks with multiple SPs providing differentiated services. Our objective is to minimize the short-term computational costs of ground users and the long-term computational cost of UAVs, simultaneously based on incomplete information. We first solve the formulated problem by reaching the Nash Equilibrium (NE) of the game among SPs based on complete information. We further formulate a Markov game model and propose a Deep Reinforcement Learning (DRL)-based UAV trajectory optimization algorithm, where only local observations of each UAV are required for each SP's flying action execution. Theoretical analysis and performance evaluation demonstrate the convergence, efficiency, scalability, and robustness of our algorithm compared with other representative algorithms.

引用

页码：5818 / 5834

页数：17

共 59 条

[1]

5G PPP Architecture Working Group, 2021, View on 5G architecture, version 4.0

[2] Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state [J].

Andrzejak, RG ;

Lehnertz, K ;

Mormann, F ;

Rieke, C ;

David, P ;

Elger, CE .

PHYSICAL REVIEW E, 2001, 64 (06) :8-061907

[3]

[Anonymous], 2022, Huawei News

[4] Data Offloading in UAV-Assisted Multi-Access Edge Computing Systems Under Resource Uncertainty [J].

Apostolopoulos, Pavlos Athanasios ;

Fragkos, Georgios ;

Tsiropoulou, Eirini Eleni ;

Papavassiliou, Symeon .

IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (01) :175-190

[5] Risk-Aware Data Offloading in Multi-Server Multi-Access Edge Computing Environment [J].

Apostolopoulos, Pavlos Athanasios ;

Tsiropoulou, Eirini Eleni ;

Papavassiliou, Symeon .

IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (03) :1405-1418

[6]

Barbera MV, 2013, IEEE INFOCOM SER, P1285

[7]

Bertsekas D. P., 2012, LIDSP2884 MIT DEP EL, P5

[8] A Survey of Current YouTube Video Characteristics [J].

Che, Xianhui ;

Ip, Barry ;

Lin, Ling .

IEEE MULTIMEDIA, 2015, 22 (02) :56-63

[9] UAV Trajectory Optimization for Data Offloading at the Edge of Multiple Cells [J].

Cheng, Fen ;

Zhang, Shun ;

Li, Zan ;

Chen, Yunfei ;

Zhao, Nan ;

Yu, F. Richard ;

Leung, Victor C. M. .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (07) :6732-6736

[10] Towards Energy-Efficient Scheduling of UAV and Base Station Hybrid Enabled Mobile Edge Computing [J].

Dai, Bin ;

Niu, Jianwei ;

Ren, Tao ;

Hu, Zheyuan ;

Atiquzzaman, Mohammed .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) :915-930

← 1 2 3 4 5 6 →