The LSTM-Based Advantage Actor-Critic Learning for Resource Management in Network Slicing With User Mobility

被引:92
作者
Li, Rongpeng [1 ]
Wang, Chujie [2 ,3 ]
Zhao, Zhifeng [4 ]
Guo, Rongbin [4 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Dept Informat Sci & Elect Engn ISEE, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Dept ISEE, Hangzhou 310027, Peoples R China
[3] Huawei, Shanghai 200120, Peoples R China
[4] Zhejiang Lab, Hangzhou 311121, Peoples R China
关键词
Resource management; Bandwidth; Feature extraction; Network slicing; Heuristic algorithms; Base stations; Reinforcement learning; network slicing; deep reinforcement learning; long short-term memory (LSTM); advantage actor critic (A2C); user mobility; 5G;
D O I
10.1109/LCOMM.2020.3001227
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Network slicing aims to efficiently provision diversified services with distinct requirements over the same physical infrastructure. Therein, in order to efficiently allocate resources across slices, demand-aware inter-slice resource management is of significant importance. In this letter, we consider a scenario that contains several slices in a radio access network with base stations that share the same physical resources (e.g., bandwidth or slots). We primarily leverage advantage actor-critic (A2C), one typical deep reinforcement learning (DRL) algorithm, to solve this problem by considering the varying service demands as the environment state and the allocated resources as the environment action. However, given that the user mobility toughens the difficulty to perceive the environment, we further incorporate the long short-term memory (LSTM) into A2C, and put forward an LSTM-A2C algorithm to track the user mobility and improve the system utility. We verify the performance of the proposed LSTM-A2C through extensive simulations.
引用
收藏
页码:2005 / 2009
页数:5
相关论文
共 16 条
  • [1] 3GPP, 2017, 22261 TS 3GPP
  • [2] [Anonymous], 2015, 36814 TR 3GPP
  • [3] 5G RAN Slicing for Verticals: Enablers and Challenges
    Elayoubi, Salah Eddine
    Ben Jemaa, Sana
    Altman, Zwi
    Galindo-Serrano, Ana
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (01) : 28 - 34
  • [4] LSTM: A Search Space Odyssey
    Greff, Klaus
    Srivastava, Rupesh K.
    Koutnik, Jan
    Steunebrink, Bas R.
    Schmidhuber, Juergen
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) : 2222 - 2232
  • [5] GAN-Powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing
    Hua, Yuxiu
    Li, Rongpeng
    Zhao, Zhifeng
    Chen, Xianfu
    Zhang, Honggang
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (02) : 334 - 349
  • [6] Deep Learning with Long Short-Term Memory for Time Series Prediction
    Hua, Yuxiu
    Zhao, Zhifeng
    Li, Rongpeng
    Chen, Xianfu
    Liu, Zhiming
    Zhang, Honggang
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (06) : 114 - 119
  • [7] Deep Reinforcement Learning for Resource Management in Network Slicing
    Li, Rongpeng
    Zhao, Zhifeng
    Sun, Qi
    I, Chih-Lin
    Yang, Chenyang
    Chen, Xianfu
    Zhao, Minjian
    Zhang, Honggang
    [J]. IEEE ACCESS, 2018, 6 : 74429 - 74441
  • [8] INTELLIGENT 5G: WHEN CELLULAR NETWORKS MEET ARTIFICIAL INTELLIGENCE
    Li, Rongpeng
    Zhao, Zhifeng
    Zhou, Xuan
    Ding, Guoru
    Chen, Yan
    Wang, Zhongyao
    Zhang, Honggang
    [J]. IEEE WIRELESS COMMUNICATIONS, 2017, 24 (05) : 175 - 183
  • [9] Deep Reinforcement Learning With Discrete Normalized Advantage Functions for Resource Management in Network Slicing
    Qi, Chen
    Hua, Yuxiu
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    [J]. IEEE COMMUNICATIONS LETTERS, 2019, 23 (08) : 1337 - 1341
  • [10] Rost P., 2019, U.S. Patent, Patent No. [20 190 159 024 A1, 20190159024]