Two-stage reinforcement-learning-based cognitive radio with exploration control

被引:21
作者
Jiang, T. [1 ]
Grace, D. [1 ]
Liu, Y. [1 ]
机构
[1] Univ York, Dept Elect, Commun Res Grp, York YO10 5DD, N Yorkshire, England
关键词
D O I
10.1049/iet-com.2009.0803
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This study presents a novel two-stage reinforcement-learning-based algorithm for distributed cognitive radio (CR) spectrum sharing. The traditional reinforcement-learning model is modified in order to be applied in a fully distributed CR scenario. CRs are able to discover the best available resources autonomously by utilising learning, which results in significantly improved performance, while reducing the need for spectrum sensing. Instead of sensing all available spectrum arbitrarily, the scheme is designed to share the spectrum based on an optimal spectrum sharing strategy, which is discovered by the CR agents from their trial-and-error interactions with the wireless communication environment. On the other hand, the inherent exploration against exploitation trade-off seen in reinforcement learning is also examined in the context of CR. A 'warm-up' stage is proposed to effectively control the exploration phase of the learning process. A better system performance can be expected by carefully balancing the tradeoff between exploration and exploitation. The benefit of applying a warm-up stage is demonstrated. Comparisons of system performance using different warm-up strategies are also given to illustrate their impact on the spectrum sharing process.
引用
收藏
页码:644 / 651
页数:8
相关论文
共 50 条
  • [41] A Two-Stage Target Search and Tracking Method for UAV Based on Deep Reinforcement Learning
    Liu, Mei
    Wei, Jingbo
    Liu, Kun
    DRONES, 2024, 8 (10)
  • [42] Two-Stage Reinforcement Learning-Based Differential Evolution for Solving Nonlinear Equations
    Liao, Zuowen
    Gong, Wenyin
    Li, Shuijia
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07): : 4279 - 4290
  • [43] An Optimized Position Control via Reinforcement-Learning-Based Hybrid Structure Strategy
    Amare, Nebiyeleul Daniel
    Yang, Sun Jick
    Son, Young Ik
    ACTUATORS, 2025, 14 (04)
  • [44] Reinforcement-learning-based control of turbulent channel flows at high Reynolds numbers
    Zhou, Zisong
    Zhang, Mengqi
    Zhu, Xiaojue
    JOURNAL OF FLUID MECHANICS, 2025, 1006
  • [45] H∞ Control for Interconnected Systems With Unknown System Dynamics: A Two-Stage Reinforcement Learning Method
    Liu, Jinxu
    Shen, Hao
    Wang, Jing
    Cao, Jinde
    Rutkowski, Leszek
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 6388 - 6397
  • [46] A Two-Stage Multi-Objective Evolutionary Reinforcement Learning Framework for Continuous Robot Control
    Hai Long Tran
    Long Doan
    Ngoc Hoang Luong
    Huynh Thi Thanh Binh
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 577 - 585
  • [47] Reinforcement-Learning-Based Proactive Control for Enabling Power Grid Resilience to Wildfire
    Kadir, Salah Uddin
    Majumder, Subir
    Srivastava, Anurag K.
    Chhokra, Ajay Dev
    Neema, Himanshu
    Dubey, Abhishek
    Laszka, Aron
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (01) : 795 - 805
  • [48] Cooperative spectrum sensing with two-stage reporting for cognitive radio networks
    So, Jaewoo
    ELECTRONICS LETTERS, 2016, 52 (01) : 83 - 84
  • [49] Two-stage Spectrum Sensing for Cognitive Radio under Noise Uncertainty
    Srisomboon, Kanabadee
    Prayote, Akara
    Lee, Wilaiporn
    2015 EIGHTH INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND UBIQUITOUS NETWORKING (ICMU), 2015, : 19 - 24
  • [50] Two-Stage Spectrum Sensing for Cognitive Radio Using Eigenvalues Detection
    Mashta, Faten
    Altabban, Wissam
    Wainakh, Mohieddin
    INTERNATIONAL JOURNAL OF INTERDISCIPLINARY TELECOMMUNICATIONS AND NETWORKING, 2020, 12 (04) : 18 - 36