Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization

被引：36

作者：

Yuan, Yaxiong ^{[1
]}

Lei, Lei ^{[1
]}

Vu, Thang X. ^{[1
]}

Chatzinotas, Symeon ^{[1
]}

Sun, Sumei ^{[2
]}

Ottersten, Bjorn ^{[1
]}

机构：

[1] Luxembourg Univ, Interdisciplinary Ctr Secur Reliabil & Trust, L-1855 Kirchberg, Luxembourg

[2] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 138632, Singapore

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2021年 / 70卷 / 05期

关键词：

Optimization; Trajectory; Heuristic algorithms; Unmanned aerial vehicles; Resource management; Propulsion; Task analysis; UAV; deep reinforcement learning; user scheduling; hovering time allocation; energy optimization; actor-critic; TRAJECTORY OPTIMIZATION; RESOURCE-ALLOCATION; FAIR COMMUNICATION; EFFICIENT; CHANNEL; DESIGN; RADIO;

D O I：

10.1109/TVT.2021.3075860

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In unmanned aerial vehicle (UAV) applications, the UAV's limited energy supply and storage have triggered the development of intelligent energy-conserving scheduling solutions. In this paper, we investigate energy minimization for UAV-aided communication networks by jointly optimizing data-transmission scheduling and UAV hovering time. The formulated problem is combinatorial and non-convex with bilinear constraints. To tackle the problem, firstly, we provide an optimal algorithm (OPT) and a golden section search heuristic algorithm (GSS-HEU). Both solutions are served as offline performance benchmarks which might not be suitable for online operations. Towards this end, from a deep reinforcement learning (DRL) perspective, we propose an actor-critic-based deep stochastic online scheduling (AC-DSOS) algorithm and develop a set of approaches to confine the action space. Compared to conventional RL/DRL, the novelty of AC-DSOS lies in handling two major issues, i.e., exponentially-increased action space and infeasible actions. Numerical results show that AC-DSOS is able to provide feasible solutions, and save around 25-30% energy compared to two conventional deep AC-DRL algorithms. Compared to the developed GSS-HEU, AC-DSOS consumes around 10% higher energy but reduces the computational time from second-level to millisecond-level.

引用

页码：5028 / 5042

页数：15

共 49 条

[21] Co-deposition of silicon with rare earth elements (REEs) and aluminium in the fern Dicranopteris linearis from China [J].

Liu, Wen-Shen ;

Zheng, Hong-Xiang ;

Guo, Mei-Na ;

Liu, Chang ;

Huot, Hermine ;

Morel, Jean Louis ;

van der Ent, Antony ;

Tang, Ye-Tao ;

Qiu, Rong-Liang .

PLANT AND SOIL, 2019, 437 (1-2) :427-437

[22]

Lu Y, 2018, AER ADV ENG RES, V176, P1

[23] COMPUTABILITY OF GLOBAL SOLUTIONS TO FACTORABLE NONCONVEX PROGRAMS .1. CONVEX UNDERESTIMATING PROBLEMS [J].

MCCORMICK, GP .

MATHEMATICAL PROGRAMMING, 1976, 10 (02) :147-175

[24] A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems [J].

Mozaffari, Mohammad ;

Saad, Walid ;

Bennis, Mehdi ;

Nam, Young-Han ;

Debbah, Merouane .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2019, 21 (03) :2334-2360

[25]

Schulman J., 2016, P INT C LEARN REPR I, P1

[26]

Shengxiang Zhu, 2018, 2018 IEEE 4th International Conference on Computer and Communications (ICCC). Proceedings, P158, DOI 10.1109/CompComm.2018.8780803

[27] Drone-Cell Trajectory Planning and Resource Allocation for Highly Mobile Networks: A Hierarchical DRL Approach [J].

Shi, Weisen ;

Li, Junling ;

Wu, Huaqing ;

Zhou, Conghao ;

Cheng, Nan ;

Shen, Xuemin .

IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) :9800-9813

[28] Auction-Based Charging Scheduling With Deep Learning Framework for Multi-Drone Networks [J].

Shin, MyungJae ;

Kim, Joongheon ;

Levorato, Marco .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) :4235-4248

[29]

Silver D, 2014, PR MACH LEARN RES, V32

[30] Energy Efficient Multi-Antenna UAV-Enabled Mobile Relay [J].

Song, Qingheng ;

Zheng, Fuchun .

CHINA COMMUNICATIONS, 2018, 15 (05) :41-50

← 1 2 3 4 5 →