The LSTM-Based Advantage Actor-Critic Learning for Resource Management in Network Slicing With User Mobility

被引：96

作者：

Li, Rongpeng ^{[1
]}

Wang, Chujie ^{[2
,3
]}

Zhao, Zhifeng ^{[4
]}

Guo, Rongbin ^{[4
]}

Zhang, Honggang ^{[1
]}

机构：

[1] Zhejiang Univ, Dept Informat Sci & Elect Engn ISEE, Hangzhou 310027, Peoples R China

[2] Zhejiang Univ, Dept ISEE, Hangzhou 310027, Peoples R China

[3] Huawei, Shanghai 200120, Peoples R China

[4] Zhejiang Lab, Hangzhou 311121, Peoples R China

来源：

IEEE COMMUNICATIONS LETTERS | 2020年 / 24卷 / 09期

关键词：

Resource management; Bandwidth; Feature extraction; Network slicing; Heuristic algorithms; Base stations; Reinforcement learning; network slicing; deep reinforcement learning; long short-term memory (LSTM); advantage actor critic (A2C); user mobility; 5G;

D O I：

10.1109/LCOMM.2020.3001227

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Network slicing aims to efficiently provision diversified services with distinct requirements over the same physical infrastructure. Therein, in order to efficiently allocate resources across slices, demand-aware inter-slice resource management is of significant importance. In this letter, we consider a scenario that contains several slices in a radio access network with base stations that share the same physical resources (e.g., bandwidth or slots). We primarily leverage advantage actor-critic (A2C), one typical deep reinforcement learning (DRL) algorithm, to solve this problem by considering the varying service demands as the environment state and the allocated resources as the environment action. However, given that the user mobility toughens the difficulty to perceive the environment, we further incorporate the long short-term memory (LSTM) into A2C, and put forward an LSTM-A2C algorithm to track the user mobility and improve the system utility. We verify the performance of the proposed LSTM-A2C through extensive simulations.

引用

页码：2005 / 2009

页数：5

共 16 条

[1]

[Anonymous], 2015, 36814 TR 3GPP

[2]

[Anonymous], 2017, 3GPP TS 22.261 V16.1.0

[3] 5G RAN Slicing for Verticals: Enablers and Challenges [J].

Elayoubi, Salah Eddine ;

Ben Jemaa, Sana ;

Altman, Zwi ;

Galindo-Serrano, Ana .

IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (01) :28-34

[4] LSTM: A Search Space Odyssey [J].

Greff, Klaus ;

Srivastava, Rupesh K. ;

Koutnik, Jan ;

Steunebrink, Bas R. ;

Schmidhuber, Juergen .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) :2222-2232

[5] GAN-Powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing [J].

Hua, Yuxiu ;

Li, Rongpeng ;

Zhao, Zhifeng ;

Chen, Xianfu ;

Zhang, Honggang .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (02) :334-349

[6] Deep Learning with Long Short-Term Memory for Time Series Prediction [J].

Hua, Yuxiu ;

Zhao, Zhifeng ;

Li, Rongpeng ;

Chen, Xianfu ;

Liu, Zhiming ;

Zhang, Honggang .

IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (06) :114-119

[7] Deep Reinforcement Learning for Resource Management in Network Slicing [J].

Li, Rongpeng ;

Zhao, Zhifeng ;

Sun, Qi ;

I, Chih-Lin ;

Yang, Chenyang ;

Chen, Xianfu ;

Zhao, Minjian ;

Zhang, Honggang .

IEEE ACCESS, 2018, 6 :74429-74441

[8] INTELLIGENT 5G: WHEN CELLULAR NETWORKS MEET ARTIFICIAL INTELLIGENCE [J].

Li, Rongpeng ;

Zhao, Zhifeng ;

Zhou, Xuan ;

Ding, Guoru ;

Chen, Yan ;

Wang, Zhongyao ;

Zhang, Honggang .

IEEE WIRELESS COMMUNICATIONS, 2017, 24 (05) :175-183

[9] Deep Reinforcement Learning With Discrete Normalized Advantage Functions for Resource Management in Network Slicing [J].

Qi, Chen ;

Hua, Yuxiu ;

Li, Rongpeng ;

Zhao, Zhifeng ;

Zhang, Honggang .

IEEE COMMUNICATIONS LETTERS, 2019, 23 (08) :1337-1341

[10]

Rost P., 2019, U.S. Patent, Patent No. [20 190 159 024 A1, 20190159024]

← 1 2 →