A Distributed Stable Strategy Learning Algorithm for Multi-User Dynamic Spectrum Access

被引:0
|
作者
Gafni, Tomer [1 ]
Cohen, Kobi [2 ,3 ]
机构
[1] Ben Gurion Univ Negev, Sch Elect & Comp Engn, IL-8410501 Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Sch Elect & Comp Engn, Cyber Secur Res Ctr, Beer Sheva, Israel
[3] Ben Gurion Univ Negev, Data Sci Res Ctr, Beer Sheva, Israel
关键词
D O I
10.1109/allerton.2019.8919920
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of multi-user dynamic spectrum access (DSA) in cognitive radio networks. The shared bandwidth is divided into K orthogonal channels, and M (secondary) users aim at accessing the spectrum, where K >= M. Each user is allowed to choose a single channel for transmission at each time slot. The state of each channel is modeled by a restless unknown Markovian process. By contrast to existing studies that analyzed a special case of this setting, in which each channel yields the same expected rate for all users, in this paper we consider the more general model, where each channel yields a different expected rate for each user. This general model adds a significant challenge of how to efficiently learn a channel allocation in a distributed manner so as to yield a global system wide objective. We adopt the stable matching utility as the system objective, which is known to yield strong performance in multichannel wireless networks, and develop a novel Distributed Stable Strategy Learning (DSSL) algorithm to achieve the objective. We prove theoretically that the DSSL algorithm converges to the stable matching allocation, and the regret, defined as the loss in total rate with respect to the stable matching solution, has a logarithmic order with time. Finally, we present numerical examples that support the theoretical results and demonstrate strong performance of the DSSL algorithm.
引用
收藏
页码:347 / 351
页数:5
相关论文
共 50 条
  • [1] Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access
    Naparstek, Oshri
    Cohen, Kobi
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (01) : 310 - 323
  • [2] Multi-user Dynamic Spectrum Access Based on Reinforcement Learning
    Xu, Jinming
    Dou, Zheng
    Qi, Lin
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [3] Distributed learning algorithm with synchronized epochs for dynamic spectrum access in unknown environment using multi-user restless multi-armed bandit
    Agrawal, Himanshu
    Asawa, Krishna
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5435 - 5447
  • [4] Dynamic security for multi-user access control in distributed environment
    Prakash, S. Jaya
    Kumar, K. Varada Raj
    Nedunuri, Deepak
    INTERNATIONAL CONFERENCE ON COMPUTER VISION AND MACHINE LEARNING, 2019, 1228
  • [5] Dynamic Spectrum Access Using Stochastic Multi-User Bandits
    Bande, Meghana
    Magesh, Akshayaa
    Veeravalli, Venugopal V.
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (05) : 953 - 956
  • [6] Deep Multi-User Reinforcement Learning for Dynamic Spectrum Access in Multichannel Wireless Networks
    Naparstek, Oshri
    Cohen, Kobi
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [7] Delay Analysis of Multi-User Dynamic Spectrum Access Networks
    Safavi, Ebrahim
    Subbalakshmi, K. P.
    2015 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (DYSPAN), 2015, : 319 - 325
  • [8] Low-Complexity Learning for Dynamic Spectrum Access in Multi-User Multi-Channel Networks
    Kang, Sunjung
    Joo, Changhee
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2018), 2018, : 1367 - 1375
  • [9] Low-Complexity Learning for Dynamic Spectrum Access in Multi-User Multi-Channel Networks
    Kang, Sunjung
    Joo, Changhee
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (11) : 3267 - 3281
  • [10] Online Learning in Decentralized Multi-user Spectrum Access with Synchronized Explorations
    Tekin, Cem
    Liu, Mingyan
    2012 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2012), 2012,