A Model-Driven Deep Reinforcement Learning Heuristic Algorithm for Resource Allocation in Ultra-Dense Cellular Networks

被引：48

作者：

Liao, Xiaomin ^{[1
,2
]}

Shi, Jia ^{[1
]}

Li, Zan ^{[1
]}

Zhang, Lei ^{[3
]}

Xia, Baiqiang ^{[4
]}

机构：

[1] Xidian Univ, State Key Lab Integrated Serv Networks, Sch Telecommun Engn, Xian 710071, Peoples R China

[2] Natl Univ Def Technol, Sch Informat & Commun, Xian 710106, Peoples R China

[3] Univ Glasgow, Sch Engn, Glasgow G12 8QQ, Lanark, Scotland

[4] Silo AI Oy, Helsinki 00180, Finland

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2020年 / 69卷 / 01期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Ultra-dense cellular networks; resource allocation; deep reinforcement learning; model-driven; optimization; ENERGY EFFICIENCY; USER ASSOCIATION; SCHEME; OPTIMIZATION; MANAGEMENT; FAIRNESS;

D O I：

10.1109/TVT.2019.2954538

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Resource allocation in ultra dense network (UDN) is an multi-objective optimization problem since it has to consider the tradeoff among spectrum efficiency (SE), energy efficiency (EE) and fairness. The existing methods can not effectively solve this NP-hard nonconvex problem, especially in the presence of limited channel state information (CSI). In this paper, we investigate a novel model-driven deep reinforcement learning assisted resource allocation method. We first design a novel deep neural network (DNN)-based optimization framework consisting of a series of Alternating Direction Method of Multipliers (ADMM) iterative procedures, which makes the CSI as the learned weights. Then a novel channel information absent Q-learning resource allocation (CIAQ) algorithm is proposed to train the DNN-based optimization framework without massive labeling data, where the SE, the EE, and the fairness can be jointly optimized by adjusting discount factor. Our simulation results show that, the proposed CIAQ with rapid convergence speed not only well characterizes the extent of optimization objective with partial CSI, but also significantly outperforms the current random initialization method of neural network and the other existing resource allocation algorithms in term of the tradeoff among the SE, EE and fairness.

引用

页码：983 / 997

页数：15

共 39 条

[1] [Anonymous], 2014, NEURAL NETWORK DESIG
[2] [Anonymous], 2016, SIGNAL PROCESSING 5G
[3] Approximation Algorithms for Online User Association in Multi-Tier Multi-Cell Mobile Networks
Ao, Weng Chon
Psounis, Konstantinos
[J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2017, 25 (04) : 2361 - 2374
[4] Boyd S., 2009, CONVEX OPTIMIZATION
[5] Chen M, 2016, IEEE I C NETW INFRAS, P155, DOI 10.1109/ICNIDC.2016.7974555
[6] Multiple Access Design for Ultra-Dense VLC Networks: Orthogonal vs Non-Orthogonal
Feng, Simeng
Zhang, Rong
Xu, Wei
Hanzo, Lajos
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (03) : 2218 - 2232
[7] Green Resource Allocation Based on Deep Reinforcement Learning in Content-Centric IoT
He, Xiaoming
Wang, Kun
Huang, Huawei
Miyazaki, Toshiaki
Wang, Yixuan
Guo, Song
[J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2020, 8 (03) : 781 - 796
[8] Wireless Backhaul: Performance Modeling and Impact on User Association for 5G
Jaber, Mona
Javier Lopez-Martinez, F.
Imran, Muhammad Ali
Sutton, Andy
Tukmanov, Anvar
Tafazolli, Rahim
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (05) : 3095 - 3110
[9] Kbah Z., 2016, P INT C COMP NETW CO, P1
[10] Li HY, 2019, IEEE T CYBERNETICS, V49, P4388, DOI [10.1109/TCYB.2018.2864776, 10.1109/TII.2018.2825225, 10.1109/ISBI.2018.8363757]

← 1 2 3 4 →