RETRACTED: Energy-aware resource management for uplink non-orthogonal multiple access: Multi-agent deep reinforcement learning (Retracted Article)

被引:5
作者
Li, Yingfang [1 ]
Yang, Bo [1 ]
Yan, Li [1 ]
Gao, Wei [2 ]
机构
[1] Honghe Univ, Sch Engn, Mengzi 661199, Peoples R China
[2] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Yunnan, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2020年 / 105卷
关键词
Non-orthogonal multiple access; Resource allocation; Energy efficiency; Deep reinforcement learning; Deep deterministic policy gradient; POWER ALLOCATION; NOMA SYSTEMS; 5G SYSTEMS; OPPORTUNITIES;
D O I
10.1016/j.future.2019.12.047
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Non-orthogonal multiple access (NOMA) is one of the promising technologies to meet the huge access demand and the high data rate requirements of the next generation networks. In this paper, we investigate the joint subchannel assignment and power allocation problem in an uplink multi-user NOMA system to maximize the energy efficiency (EE) while ensuring the quality-of-service (QoS) of all users. Different from conventional model-based resource allocation methods, we propose two deep reinforcement learning (DRL) based frameworks to solve this non-convex and dynamic optimization problem, referred to as discrete DRL based resource allocation (DDRA) framework and continuous DRL based resource allocation (CDRA) framework. Specifically, for the DDRA framework, we use a deep Q network (DQN) to output the optimum subchannel assignment policy, and design a distributed and discretized multi-DQN based network to allocate the corresponding transmit power of all users. For the CDRA framework, we design a joint DQN and deep deterministic policy gradient (DDPG) based network to generate the optimal subchannel assignment and power allocation policy. The entire resource allocation policies of these two frameworks are adjusted by updating the weights of their neural networks according to feedback of the system. Numerical results show that the proposed DRL-based resource allocation frameworks can significantly improve the EE of the whole NOMA system compared with other approaches. The proposed DRL based frameworks can provide good performance in various moving speed scenarios through adjusting learning parameters. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:684 / 694
页数:11
相关论文
共 36 条
  • [1] Downlink Power Allocation for CoMP-NOMA in Multi-Cell Networks
    Ali, Md Shipon
    Hossain, Ekram
    Al-Dweik, Arafat
    Kim, Dong In
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2018, 66 (09) : 3982 - 3998
  • [2] [Anonymous], 2014, P INT C INT C MACH L
  • [3] Bertsekas Dimitri P, 2011, Dynamic programming and optimal control, VII
  • [4] Busoniu Lucian, 2017, Reinforcement Learning and Dynamic Programming Using Function Approximators
  • [5] Learning Radio Resource Management in RANs: Framework, Opportunities, and Challenges
    Calabrese, Francesco Davide
    Wang, Li
    Ghadimi, Euhanna
    Peters, Gunnar
    Hanzo, Lajos
    Soldati, Pablo
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (09) : 138 - 145
  • [6] Reinforcement Learning-Based Multiaccess Control and Battery Prediction With Energy Harvesting in IoT Systems
    Chu, Man
    Li, Hang
    Liao, Xuewen
    Cui, Shuguang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02): : 2009 - 2020
  • [7] A Survey on Non-Orthogonal Multiple Access for 5G Networks: Research Challenges and Future Trends
    Ding, Zhiguo
    Lei, Xianfu
    Karagiannidis, George K.
    Schober, Robert
    Yuan, Jinhong
    Bhargava, Vijay K.
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2017, 35 (10) : 2181 - 2195
  • [8] On the Performance of Non-Orthogonal Multiple Access in 5G Systems with Randomly Deployed Users
    Ding, Zhiguo
    Yang, Zheng
    Fan, Pingzhi
    Poor, H. Vincent
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (12) : 1501 - 1505
  • [9] Learning Optimal Resource Allocations in Wireless Systems
    Eisen, Mark
    Zhang, Clark
    Chamon, Luiz F. O.
    Lee, Daniel D.
    Ribeiro, Alejandro
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (10) : 2775 - 2790
  • [10] Joint User Scheduling and Power Allocation Optimization for Energy-Efficient NOMA Systems With Imperfect CSI
    Fang, Fang
    Zhang, Haijun
    Cheng, Julian
    Roy, Sebastien
    Leung, Victor C. M.
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2017, 35 (12) : 2874 - 2885