Two-stage reinforcement-learning-based cognitive radio with exploration control

被引:21
|
作者
Jiang, T. [1 ]
Grace, D. [1 ]
Liu, Y. [1 ]
机构
[1] Univ York, Dept Elect, Commun Res Grp, York YO10 5DD, N Yorkshire, England
关键词
D O I
10.1049/iet-com.2009.0803
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This study presents a novel two-stage reinforcement-learning-based algorithm for distributed cognitive radio (CR) spectrum sharing. The traditional reinforcement-learning model is modified in order to be applied in a fully distributed CR scenario. CRs are able to discover the best available resources autonomously by utilising learning, which results in significantly improved performance, while reducing the need for spectrum sensing. Instead of sensing all available spectrum arbitrarily, the scheme is designed to share the spectrum based on an optimal spectrum sharing strategy, which is discovered by the CR agents from their trial-and-error interactions with the wireless communication environment. On the other hand, the inherent exploration against exploitation trade-off seen in reinforcement learning is also examined in the context of CR. A 'warm-up' stage is proposed to effectively control the exploration phase of the learning process. A better system performance can be expected by carefully balancing the tradeoff between exploration and exploitation. The benefit of applying a warm-up stage is demonstrated. Comparisons of system performance using different warm-up strategies are also given to illustrate their impact on the spectrum sharing process.
引用
收藏
页码:644 / 651
页数:8
相关论文
共 50 条
  • [1] CLUSTERING AND REINFORCEMENT-LEARNING-BASED ROUTING FOR COGNITIVE RADIO NETWORKS
    Saleem, Yasir
    Yau, Kok-Lim Alvin
    Mohamad, Hafizal
    Ramli, Nordin
    Rehmani, Mubashir Husain
    Ni, Qiang
    IEEE WIRELESS COMMUNICATIONS, 2017, 24 (04) : 146 - 151
  • [2] Spectrum Access In Cognitive Radio Using a Two-Stage Reinforcement Learning Approach
    Raj, Vishnu
    Dias, Irene
    Tholeti, Thulasi
    Kalyani, Sheetal
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (01) : 20 - 34
  • [3] Two-Stage Evolutionary Reinforcement Learning for Enhancing Exploration and Exploitation
    Zhu, Qingling
    Wu, Xiaoqiang
    Lin, Qiuzhen
    Chen, Wei-Neng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20892 - 20900
  • [4] Reinforcement-Learning-Based Double Auction Design for Dynamic Spectrum Access in Cognitive Radio Networks
    Yinglei Teng
    F. Richard Yu
    Ke Han
    Yifei Wei
    Yong Zhang
    Wireless Personal Communications, 2013, 69 : 771 - 791
  • [5] Reinforcement-Learning-Based Double Auction Design for Dynamic Spectrum Access in Cognitive Radio Networks
    Teng, Yinglei
    Yu, F. Richard
    Han, Ke
    Wei, Yifei
    Zhang, Yong
    WIRELESS PERSONAL COMMUNICATIONS, 2013, 69 (02) : 771 - 791
  • [6] Reinforcement-learning-based control of convectively unstable flows
    Xu, Da
    Zhang, Mengqi
    JOURNAL OF FLUID MECHANICS, 2023, 954
  • [7] Sensing, Probing, and Transmitting Policy for Energy Harvesting Cognitive Radio With Two-Stage After-State Reinforcement Learning
    Wu, Keyu
    Jiang, Hai
    Tellambura, Chintha
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) : 1616 - 1630
  • [8] Efficient exploration in reinforcement learning-based cognitive radio spectrum sharing
    Jiang, T.
    Grace, D.
    Mitchell, P. D.
    IET COMMUNICATIONS, 2011, 5 (10) : 1309 - 1317
  • [9] A Two-Stage Spectrum Sensing Scheme Based on Cyclostationarity in Cognitive Radio
    Lin, Ying-pei
    He, Chen
    Jiang, Ling-ge
    He, Di
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2011, E94B (09) : 2681 - 2684
  • [10] Two-Stage Population Based Training Method for Deep Reinforcement Learning
    Zhou, Yinda
    Liu, Weiming
    Li, Bin
    2019 THE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPILATION, COMPUTING AND COMMUNICATIONS (HP3C 2019), 2019, : 38 - 44