Energy-Efficient Resource Allocation in Uplink NOMA Systems with Deep Reinforcement Learning

被引:43
作者
Zhang, Yuhan [1 ]
Wang, Xiaoming [1 ,2 ]
Xu, Youyun [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Natl Engn Res Ctr Commun & Networking, Nanjing 210003, Peoples R China
[2] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
来源
2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP) | 2019年
基金
中国国家自然科学基金;
关键词
Non-orthogonal multiple access (NOMA); resource allocation; deep reinforcement learning; deep deterministic policy gradient; NONORTHOGONAL MULTIPLE-ACCESS;
D O I
10.1109/wcsp.2019.8927898
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Non-orthogonal multiple access (NOMA) is regarded as a promising technology to satisfy the huge access demand and data rate requirements of the next generation network. In this paper, we investigate the joint subcarrier assignment and power allocation problem in an uplink multi-user NOMA system to maximize the energy efficiency (EE) while ensuring the quality-of-service (QoS) of all users. Different from conventional resource allocation methods, we propose a two-step deep reinforcement learning (DRL) based algorithm to solve this non-convex and dynamic optimization problem. In particular, with the current channel conditions as input, we design a deep q-network (DQN) to output the optimum subcarrier assignment policy, then use a deep deterministic policy gradient (DDPG) network to dynamically output the transmit power of all users, and finally adjust the entire resource allocation policy by updating the weights of neural networks according to the feedback of the system. Simulation results show that our DRL based algorithm can provide better EE under various transmit power limitations compared with other approaches.
引用
收藏
页数:6
相关论文
共 13 条
[1]   Reinforcement Learning-Based Multiaccess Control and Battery Prediction With Energy Harvesting in IoT Systems [J].
Chu, Man ;
Li, Hang ;
Liao, Xuewen ;
Cui, Shuguang .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) :2009-2020
[2]   Deep Learning for an Effective Nonorthogonal Multiple Access Scheme [J].
Gui, Guan ;
Huang, Hongji ;
Song, Yiwei ;
Sari, Hikmet .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (09) :8440-8450
[3]   Power-Domain Non-Orthogonal Multiple Access (NOMA) in 5G Systems: Potentials and Challenges [J].
Islam, S. M. Riazul ;
Avazov, Nurilla ;
Dobre, Octavia A. ;
Kwak, Kyung-Sup .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2017, 19 (02) :721-742
[4]  
Lillicrap TP, 2015, ARXIV150902971
[5]   Deep Cognitive Perspective: Resource Allocation for NOMA-Based Heterogeneous IoT With Imperfect SIC [J].
Liu, Miao ;
Song, Tiecheng ;
Gui, Guan .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) :2885-2894
[6]   Nonorthogonal Multiple Access for 5G and Beyond [J].
Liu, Yuanwei ;
Qin, Zhijin ;
Elkashlan, Maged ;
Ding, Zhiguo ;
Nallanathan, Arumugam ;
Hanzo, Lajos .
PROCEEDINGS OF THE IEEE, 2017, 105 (12) :2347-2381
[7]   Human-level control through deep reinforcement learning [J].
Mnih, Volodymyr ;
Kavukcuoglu, Koray ;
Silver, David ;
Rusu, Andrei A. ;
Veness, Joel ;
Bellemare, Marc G. ;
Graves, Alex ;
Riedmiller, Martin ;
Fidjeland, Andreas K. ;
Ostrovski, Georg ;
Petersen, Stig ;
Beattie, Charles ;
Sadik, Amir ;
Antonoglou, Ioannis ;
King, Helen ;
Kumaran, Dharshan ;
Wierstra, Daan ;
Legg, Shane ;
Hassabis, Demis .
NATURE, 2015, 518 (7540) :529-533
[8]  
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[9]   Sentiment Analysis of Micro-blog Integrated on Explicit Semantic Analysis Method [J].
Wang, Chun Hua ;
Han, Dong .
WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (02) :1095-1105
[10]   Reinforcement Learning-Based NOMA Power Allocation in the Presence of Smart Jamming [J].
Xiao, Liang ;
Li, Yanda ;
Dai, Canhuang ;
Dai, Huaiyu ;
Poor, H. Vincent .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (04) :3377-3389