Uplink NOMA-based long-term throughput maximization scheme for cognitive radio networks: an actor-critic reinforcement learning approach

被引:3
作者
Giang, Hoang Thi Huong [1 ]
Hoan, Tran Nhut Khai [2 ]
Koo, Insoo [1 ]
机构
[1] Univ Ulsan UOU, Sch Elect Engn, Ulsan, South Korea
[2] Can Tho Univ, Can Tho, Vietnam
关键词
Cognitive radio network; NOMA; Energy harvesting; Actor– critic; NONORTHOGONAL MULTIPLE-ACCESS; POWER-CONTROL; 5G; OPPORTUNITIES; CHALLENGES; FUSION;
D O I
10.1007/s11276-020-02520-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Non-orthogonal multiple access (NOMA) is one of the promising techniques for spectrum efficiency in wireless networks. In this paper, we consider an uplink NOMA cognitive system, where the secondary users (SUs) can jointly transmit data to the cognitive base station (CBS) over the same spectrum resources. Thereafter, successive interference cancellation is applied at the CBS to retrieve signals transmitted by the SUs. In addition, the energy-constrained problem in wireless networks is taken into account. Therefore, we assume that the SUs are powered by a wireless energy harvester to prolong their operations; meanwhile, the CBS is equipped with a traditional electrical supply. Herein, we propose an actor-critic reinforcement learning approach to maximize the long-term throughput of the cognitive network. In particular, by interacting and learning directly from the environment over several time slots, the CBS can optimally assign the amount of transmission energy for each SU according to the remaining energy of the SUs and the availability of the primary channel. As a consequence, the simulation results verify that the proposed scheme outperforms other conventional approaches (such as Myopic NOMA and OMA), so the system reward is always maximized in the current time slot, in terms of overall throughput and energy efficiency.
引用
收藏
页码:1319 / 1334
页数:16
相关论文
共 64 条
[1]   A survey on spectrum management in cognitive radio networks [J].
Akyildiz, Ian F. ;
Lee, Won-Yeol ;
Vuran, Mehmet C. ;
Mohanty, Shantidev .
IEEE COMMUNICATIONS MAGAZINE, 2008, 46 (04) :40-48
[2]  
Al-Imari Mohammed, 2012, 2012 International Conference on Future Communication Networks (ICFCN), P52, DOI 10.1109/ICFCN.2012.6206872
[3]  
Alechina N, 2010, SPECIFICATION AND VERIFICATION OF MULTI-AGENT SYSTEMS, P1, DOI 10.1007/978-1-4419-6984-2_1
[4]   Dynamic User Clustering and Power Allocation for Uplink and Downlink Non-Orthogonal Multiple Access (NOMA) Systems [J].
Ali, Md Shipon ;
Tabassum, Hina ;
Hossain, Ekram .
IEEE ACCESS, 2016, 4 :6325-6343
[5]  
[Anonymous], 2016, ZTE Communications
[6]  
[Anonymous], 2006, P IEEE GLOBECOM
[7]   Hybrid Energy Harvesting-Based Cooperative Spectrum Sensing and Access in Heterogeneous Cognitive Radio Networks [J].
Celik, Abdulkadir ;
Alsharoa, Ahmad ;
Kamal, Ahmed E. .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2017, 3 (01) :37-48
[8]   A Single-Chip Solar Energy Harvesting IC Using Integrated Photodiodes for Biomedical Implant Applications [J].
Chen, Zhiyuan ;
Law, Man-Kay ;
Mak, Pui-In ;
Martins, Rui P. .
IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2017, 11 (01) :44-53
[9]   Energy-Efficient Cooperative Spectrum Sensing: A Survey [J].
Cichon, Krzysztof ;
Kliks, Adrian ;
Bogucka, Hanna .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (03) :1861-1886
[10]  
Crites R. H., 1995, Advances in Neural Information Processing Systems 7, P401