Two-stage reinforcement-learning-based cognitive radio with exploration control

被引：21

作者：

Jiang, T. ^{[1
]}

Grace, D. ^{[1
]}

Liu, Y. ^{[1
]}

机构：

[1] Univ York, Dept Elect, Commun Res Grp, York YO10 5DD, N Yorkshire, England

来源：

IET COMMUNICATIONS | 2011年 / 5卷 / 05期

关键词：

D O I：

10.1049/iet-com.2009.0803

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This study presents a novel two-stage reinforcement-learning-based algorithm for distributed cognitive radio (CR) spectrum sharing. The traditional reinforcement-learning model is modified in order to be applied in a fully distributed CR scenario. CRs are able to discover the best available resources autonomously by utilising learning, which results in significantly improved performance, while reducing the need for spectrum sensing. Instead of sensing all available spectrum arbitrarily, the scheme is designed to share the spectrum based on an optimal spectrum sharing strategy, which is discovered by the CR agents from their trial-and-error interactions with the wireless communication environment. On the other hand, the inherent exploration against exploitation trade-off seen in reinforcement learning is also examined in the context of CR. A 'warm-up' stage is proposed to effectively control the exploration phase of the learning process. A better system performance can be expected by carefully balancing the tradeoff between exploration and exploitation. The benefit of applying a warm-up stage is demonstrated. Comparisons of system performance using different warm-up strategies are also given to illustrate their impact on the spectrum sharing process.

引用

页码：644 / 651

页数：8

共 50 条

[1] CLUSTERING AND REINFORCEMENT-LEARNING-BASED ROUTING FOR COGNITIVE RADIO NETWORKS
Saleem, Yasir
Yau, Kok-Lim Alvin
Mohamad, Hafizal
Ramli, Nordin
Rehmani, Mubashir Husain
Ni, Qiang
IEEE WIRELESS COMMUNICATIONS, 2017, 24 (04) : 146 - 151
[2] Spectrum Access In Cognitive Radio Using a Two-Stage Reinforcement Learning Approach
Raj, Vishnu
Dias, Irene
Tholeti, Thulasi
Kalyani, Sheetal
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (01) : 20 - 34
[3] Two-Stage Evolutionary Reinforcement Learning for Enhancing Exploration and Exploitation
Zhu, Qingling
Wu, Xiaoqiang
Lin, Qiuzhen
Chen, Wei-Neng
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20892 - 20900
[4] Reinforcement-Learning-Based Double Auction Design for Dynamic Spectrum Access in Cognitive Radio Networks
Yinglei Teng
F. Richard Yu
Ke Han
Yifei Wei
Yong Zhang
Wireless Personal Communications, 2013, 69 : 771 - 791
[5] Reinforcement-Learning-Based Double Auction Design for Dynamic Spectrum Access in Cognitive Radio Networks
Teng, Yinglei
Yu, F. Richard
Han, Ke
Wei, Yifei
Zhang, Yong
WIRELESS PERSONAL COMMUNICATIONS, 2013, 69 (02) : 771 - 791
[6] Reinforcement-learning-based control of convectively unstable flows
Xu, Da
Zhang, Mengqi
JOURNAL OF FLUID MECHANICS, 2023, 954
[7] Sensing, Probing, and Transmitting Policy for Energy Harvesting Cognitive Radio With Two-Stage After-State Reinforcement Learning
Wu, Keyu
Jiang, Hai
Tellambura, Chintha
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) : 1616 - 1630
[8] Efficient exploration in reinforcement learning-based cognitive radio spectrum sharing
Jiang, T.
Grace, D.
Mitchell, P. D.
IET COMMUNICATIONS, 2011, 5 (10) : 1309 - 1317
[9] A Two-Stage Spectrum Sensing Scheme Based on Cyclostationarity in Cognitive Radio
Lin, Ying-pei
He, Chen
Jiang, Ling-ge
He, Di
IEICE TRANSACTIONS ON COMMUNICATIONS, 2011, E94B (09) : 2681 - 2684
[10] Two-Stage Population Based Training Method for Deep Reinforcement Learning
Zhou, Yinda
Liu, Weiming
Li, Bin
2019 THE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPILATION, COMPUTING AND COMMUNICATIONS (HP3C 2019), 2019, : 38 - 44

← 1 2 3 4 5 →