Deep Reinforcement Learning for User Association and Resource Allocation in Heterogeneous Cellular Networks

被引:326
作者
Zhao, Nan [1 ,2 ]
Liang, Ying-Chang [2 ]
Niyato, Dusit [3 ]
Pei, Yiyang [4 ]
Wu, Minghu [5 ]
Jiang, Yunhao [5 ]
机构
[1] Hubei Univ Technol, Hubei Collaborat Innovat Ctr High Efficiency Util, Wuhan 430068, Hubei, Peoples R China
[2] Univ Elect Sci & Technol China, CINC, Chengdu 611731, Sichuan, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[4] Singapore Inst Technol, Infocomm Technol Cluster, Singapore, Singapore
[5] Hubei Univ Technol, Hubei Key Lab High Efficiency Utilizat Solar Ener, Wuhan 430068, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Heterogeneous cellular networks; user association; resource allocation; multi-agent deep reinforcement learning; ACCESS; MANAGEMENT; SELECTION; HETNETS;
D O I
10.1109/TWC.2019.2933417
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Heterogeneous cellular networks can offload the mobile traffic and reduce the deployment costs, which have been considered to be a promising technique in the next-generation wireless network. Due to the non-convex and combinatorial characteristics, it is challenging to obtain an optimal strategy for the joint user association and resource allocation issue. In this paper, a reinforcement learning (RL) approach is proposed to achieve the maximum long-term overall network utility while guaranteeing the quality of service requirements of user equipments (UEs) in the downlink of heterogeneous cellular networks. A distributed optimization method based on multi-agent RL is developed. Moreover, to solve the computationally expensive problem with the large action space, multi-agent deep RL method is proposed. Specifically, the state, action and reward function are defined for UEs, and dueling double deep Q-network (D3QN) strategy is introduced to obtain the nearly optimal policy. Through message passing, the distributed UEs can obtain the global state space with a small communication overhead. With the double-Q strategy and dueling architecture, D3QN can rapidly converge to a subgame perfect Nash equilibrium. Simulation results demonstrate that D3QN achieves the better performance than other RL approaches in solving large-scale learning problems.
引用
收藏
页码:5141 / 5152
页数:12
相关论文
共 46 条
  • [11] Resource Allocation and Inter-Cell Interference Management for Dual-Access Small Cells
    Elsherif, Ahmed R.
    Chen, Wei-Peng
    Ito, Akira
    Ding, Zhi
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2015, 33 (06) : 1082 - 1096
  • [12] Fan Y, 2017, P 2 INT C REL SYST E, P1, DOI DOI 10.1109/ICEMS.2017.8056440
  • [13] Joint Resource Allocation and User Association for Heterogeneous Wireless Cellular Networks
    Fooladivanda, Dariush
    Rosenberg, Catherine
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (01) : 248 - 257
  • [14] Ghadimi E., 2017, 2017 IEEE INT C COMM, P1
  • [15] Backhaul-Aware User Association and Resource Allocation for Energy-Constrained HetNets
    Han, Qiaoni
    Yang, Bo
    Miao, Guowang
    Chen, Cailian
    Wang, Xiaocheng
    Guan, Xinping
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (01) : 580 - 593
  • [16] Software-Defined Networks with Mobile Edge Computing and Caching for Smart Cities: A Big Data Deep Reinforcement Learning Approach
    He, Ying
    Yu, F. Richard
    Zhao, Nan
    Leung, Victor C. M.
    Yin, Hongxi
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (12) : 31 - 37
  • [17] Intelligent Power Control for Spectrum Sharing in Cognitive Radios: A Deep Reinforcement Learning Approach
    Li, Xingjian
    Fang, Jun
    Cheng, Wen
    Duan, Huiping
    Chen, Zhi
    Li, Hongbin
    [J]. IEEE ACCESS, 2018, 6 : 25463 - 25473
  • [18] User Association for Load Balancing in Vehicular Networks: An Online Reinforcement Learning Approach
    Li, Zhong
    Wang, Cheng
    Jiang, Chang-Jun
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (08) : 2217 - 2228
  • [19] ULTRA-LOW-LATENCY UBIQUITOUS CONNECTIONS IN HETEROGENEOUS CLOUD RADIO ACCESS NETWORKS
    Lien, Shao-Yu
    Hung, Shao-Chou
    Chen, Kwang-Cheng
    Liang, Ying-Chang
    [J]. IEEE WIRELESS COMMUNICATIONS, 2015, 22 (03) : 22 - 31
  • [20] Optimizing User Association and Spectrum Allocation in HetNets: A Utility Perspective
    Lin, Yicheng
    Bao, Wei
    Yu, Wei
    Liang, Ben
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2015, 33 (06) : 1025 - 1039