A Model-Driven Deep Reinforcement Learning Heuristic Algorithm for Resource Allocation in Ultra-Dense Cellular Networks

被引:48
作者
Liao, Xiaomin [1 ,2 ]
Shi, Jia [1 ]
Li, Zan [1 ]
Zhang, Lei [3 ]
Xia, Baiqiang [4 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Sch Telecommun Engn, Xian 710071, Peoples R China
[2] Natl Univ Def Technol, Sch Informat & Commun, Xian 710106, Peoples R China
[3] Univ Glasgow, Sch Engn, Glasgow G12 8QQ, Lanark, Scotland
[4] Silo AI Oy, Helsinki 00180, Finland
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Ultra-dense cellular networks; resource allocation; deep reinforcement learning; model-driven; optimization; ENERGY EFFICIENCY; USER ASSOCIATION; SCHEME; OPTIMIZATION; MANAGEMENT; FAIRNESS;
D O I
10.1109/TVT.2019.2954538
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Resource allocation in ultra dense network (UDN) is an multi-objective optimization problem since it has to consider the tradeoff among spectrum efficiency (SE), energy efficiency (EE) and fairness. The existing methods can not effectively solve this NP-hard nonconvex problem, especially in the presence of limited channel state information (CSI). In this paper, we investigate a novel model-driven deep reinforcement learning assisted resource allocation method. We first design a novel deep neural network (DNN)-based optimization framework consisting of a series of Alternating Direction Method of Multipliers (ADMM) iterative procedures, which makes the CSI as the learned weights. Then a novel channel information absent Q-learning resource allocation (CIAQ) algorithm is proposed to train the DNN-based optimization framework without massive labeling data, where the SE, the EE, and the fairness can be jointly optimized by adjusting discount factor. Our simulation results show that, the proposed CIAQ with rapid convergence speed not only well characterizes the extent of optimization objective with partial CSI, but also significantly outperforms the current random initialization method of neural network and the other existing resource allocation algorithms in term of the tradeoff among the SE, EE and fairness.
引用
收藏
页码:983 / 997
页数:15
相关论文
共 39 条
  • [1] [Anonymous], 2014, NEURAL NETWORK DESIG
  • [2] [Anonymous], 2016, SIGNAL PROCESSING 5G
  • [3] Approximation Algorithms for Online User Association in Multi-Tier Multi-Cell Mobile Networks
    Ao, Weng Chon
    Psounis, Konstantinos
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2017, 25 (04) : 2361 - 2374
  • [4] Boyd S., 2009, CONVEX OPTIMIZATION
  • [5] Chen M, 2016, IEEE I C NETW INFRAS, P155, DOI 10.1109/ICNIDC.2016.7974555
  • [6] Multiple Access Design for Ultra-Dense VLC Networks: Orthogonal vs Non-Orthogonal
    Feng, Simeng
    Zhang, Rong
    Xu, Wei
    Hanzo, Lajos
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (03) : 2218 - 2232
  • [7] Green Resource Allocation Based on Deep Reinforcement Learning in Content-Centric IoT
    He, Xiaoming
    Wang, Kun
    Huang, Huawei
    Miyazaki, Toshiaki
    Wang, Yixuan
    Guo, Song
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2020, 8 (03) : 781 - 796
  • [8] Wireless Backhaul: Performance Modeling and Impact on User Association for 5G
    Jaber, Mona
    Javier Lopez-Martinez, F.
    Imran, Muhammad Ali
    Sutton, Andy
    Tukmanov, Anvar
    Tafazolli, Rahim
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (05) : 3095 - 3110
  • [9] Kbah Z., 2016, P INT C COMP NETW CO, P1
  • [10] Li HY, 2019, IEEE T CYBERNETICS, V49, P4388, DOI [10.1109/TCYB.2018.2864776, 10.1109/TII.2018.2825225, 10.1109/ISBI.2018.8363757]