Deep Reinforcement Learning for User Association and Resource Allocation in Heterogeneous Cellular Networks

被引：326

作者：

Zhao, Nan ^{[1
,2
]}

Liang, Ying-Chang ^{[2
]}

Niyato, Dusit ^{[3
]}

Pei, Yiyang ^{[4
]}

Wu, Minghu ^{[5
]}

Jiang, Yunhao ^{[5
]}

机构：

[1] Hubei Univ Technol, Hubei Collaborat Innovat Ctr High Efficiency Util, Wuhan 430068, Hubei, Peoples R China

[2] Univ Elect Sci & Technol China, CINC, Chengdu 611731, Sichuan, Peoples R China

[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

[4] Singapore Inst Technol, Infocomm Technol Cluster, Singapore, Singapore

[5] Hubei Univ Technol, Hubei Key Lab High Efficiency Utilizat Solar Ener, Wuhan 430068, Hubei, Peoples R China

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2019年 / 18卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Heterogeneous cellular networks; user association; resource allocation; multi-agent deep reinforcement learning; ACCESS; MANAGEMENT; SELECTION; HETNETS;

D O I：

10.1109/TWC.2019.2933417

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Heterogeneous cellular networks can offload the mobile traffic and reduce the deployment costs, which have been considered to be a promising technique in the next-generation wireless network. Due to the non-convex and combinatorial characteristics, it is challenging to obtain an optimal strategy for the joint user association and resource allocation issue. In this paper, a reinforcement learning (RL) approach is proposed to achieve the maximum long-term overall network utility while guaranteeing the quality of service requirements of user equipments (UEs) in the downlink of heterogeneous cellular networks. A distributed optimization method based on multi-agent RL is developed. Moreover, to solve the computationally expensive problem with the large action space, multi-agent deep RL method is proposed. Specifically, the state, action and reward function are defined for UEs, and dueling double deep Q-network (D3QN) strategy is introduced to obtain the nearly optimal policy. Through message passing, the distributed UEs can obtain the global state space with a small communication overhead. With the double-Q strategy and dueling architecture, D3QN can rapidly converge to a subgame perfect Nash equilibrium. Simulation results demonstrate that D3QN achieves the better performance than other RL approaches in solving large-scale learning problems.

引用

页码：5141 / 5152

页数：12

共 46 条

[11] Resource Allocation and Inter-Cell Interference Management for Dual-Access Small Cells
Elsherif, Ahmed R.
Chen, Wei-Peng
Ito, Akira
Ding, Zhi
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2015, 33 (06) : 1082 - 1096
[12] Fan Y, 2017, P 2 INT C REL SYST E, P1, DOI DOI 10.1109/ICEMS.2017.8056440
[13] Joint Resource Allocation and User Association for Heterogeneous Wireless Cellular Networks
Fooladivanda, Dariush
Rosenberg, Catherine
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (01) : 248 - 257
[14] Ghadimi E., 2017, 2017 IEEE INT C COMM, P1
[15] Backhaul-Aware User Association and Resource Allocation for Energy-Constrained HetNets
Han, Qiaoni
Yang, Bo
Miao, Guowang
Chen, Cailian
Wang, Xiaocheng
Guan, Xinping
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (01) : 580 - 593
[16] Software-Defined Networks with Mobile Edge Computing and Caching for Smart Cities: A Big Data Deep Reinforcement Learning Approach
He, Ying
Yu, F. Richard
Zhao, Nan
Leung, Victor C. M.
Yin, Hongxi
[J]. IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (12) : 31 - 37
[17] Intelligent Power Control for Spectrum Sharing in Cognitive Radios: A Deep Reinforcement Learning Approach
Li, Xingjian
Fang, Jun
Cheng, Wen
Duan, Huiping
Chen, Zhi
Li, Hongbin
[J]. IEEE ACCESS, 2018, 6 : 25463 - 25473
[18] User Association for Load Balancing in Vehicular Networks: An Online Reinforcement Learning Approach
Li, Zhong
Wang, Cheng
Jiang, Chang-Jun
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (08) : 2217 - 2228
[19] ULTRA-LOW-LATENCY UBIQUITOUS CONNECTIONS IN HETEROGENEOUS CLOUD RADIO ACCESS NETWORKS
Lien, Shao-Yu
Hung, Shao-Chou
Chen, Kwang-Cheng
Liang, Ying-Chang
[J]. IEEE WIRELESS COMMUNICATIONS, 2015, 22 (03) : 22 - 31
[20] Optimizing User Association and Spectrum Allocation in HetNets: A Utility Perspective
Lin, Yicheng
Bao, Wei
Yu, Wei
Liang, Ben
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2015, 33 (06) : 1025 - 1039

← 1 2 3 4 5 →