Uplink NOMA-based long-term throughput maximization scheme for cognitive radio networks: an actor-critic reinforcement learning approach

被引:3
作者
Giang, Hoang Thi Huong [1 ]
Hoan, Tran Nhut Khai [2 ]
Koo, Insoo [1 ]
机构
[1] Univ Ulsan UOU, Sch Elect Engn, Ulsan, South Korea
[2] Can Tho Univ, Can Tho, Vietnam
关键词
Cognitive radio network; NOMA; Energy harvesting; Actor– critic; NONORTHOGONAL MULTIPLE-ACCESS; POWER-CONTROL; 5G; OPPORTUNITIES; CHALLENGES; FUSION;
D O I
10.1007/s11276-020-02520-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Non-orthogonal multiple access (NOMA) is one of the promising techniques for spectrum efficiency in wireless networks. In this paper, we consider an uplink NOMA cognitive system, where the secondary users (SUs) can jointly transmit data to the cognitive base station (CBS) over the same spectrum resources. Thereafter, successive interference cancellation is applied at the CBS to retrieve signals transmitted by the SUs. In addition, the energy-constrained problem in wireless networks is taken into account. Therefore, we assume that the SUs are powered by a wireless energy harvester to prolong their operations; meanwhile, the CBS is equipped with a traditional electrical supply. Herein, we propose an actor-critic reinforcement learning approach to maximize the long-term throughput of the cognitive network. In particular, by interacting and learning directly from the environment over several time slots, the CBS can optimally assign the amount of transmission energy for each SU according to the remaining energy of the SUs and the availability of the primary channel. As a consequence, the simulation results verify that the proposed scheme outperforms other conventional approaches (such as Myopic NOMA and OMA), so the system reward is always maximized in the current time slot, in terms of overall throughput and energy efficiency.
引用
收藏
页码:1319 / 1334
页数:16
相关论文
共 64 条
[31]   Cognitive radio: Making software radios more personal [J].
Mitola, J ;
Maguire, GQ .
IEEE PERSONAL COMMUNICATIONS, 1999, 6 (04) :13-18
[32]  
Nikopour H, 2014, IEEE GLOB COMM CONF, P3940, DOI 10.1109/GLOCOM.2014.7037423
[33]   Joint Resource Allocation and Transmission Mode Selection Using a POMDP-Based Hybrid Half-Duplex/Full-Duplex Scheme for Secrecy Rate Maximization in Multi-Channel Cognitive Radio Networks [J].
Pham Duy Thanh ;
Tran Nhut Khai Hoan ;
Koo, Insoo .
IEEE SENSORS JOURNAL, 2020, 20 (07) :3930-3945
[34]   Efficient attack strategy for legitimate energy-powered eavesdropping in tactical cognitive radio networks [J].
Pham Duy Thanh ;
Tran Nhut Khai Hoan ;
Hiep Vu-Van ;
Koo, Insoo .
WIRELESS NETWORKS, 2019, 25 (06) :3605-3622
[35]   Optimal Linear Fusion for Distributed Detection Via Semidefinite Programming [J].
Quan, Zhi ;
Ma, Wing-Kin ;
Cui, Shuguang ;
Sayed, Ali H. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (04) :2431-2436
[36]   Information Theoretic Analysis of LDS Scheme [J].
Razavi, R. ;
Hoshyar, R. ;
Imran, M. A. ;
Wang, Y. .
IEEE COMMUNICATIONS LETTERS, 2011, 15 (08) :798-800
[37]  
Ribeiro FC, 2012, INT CONF ACOUST SPEE, P3557, DOI 10.1109/ICASSP.2012.6288685
[38]  
Saito Y, 2013, IEEE VTS VEH TECHNOL
[39]   Actor-Critic-Algorithm-Based Accurate Spectrum Sensing and Transmission Framework and Energy Conservation in Energy-Constrained Wireless Sensor Network-Based Cognitive Radios [J].
Shah, Hurmat Ali ;
Koo, Insoo ;
Kwak, Kyung Sup .
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2019, 2019
[40]   Spectrum agile radios: Utilization and sensing architectures [J].
Shankar, S ;
Cordeiro, C ;
Challapali, K .
2005 1st IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks, Conference Record, 2005, :160-169