Non-cooperative reinforcement learning based routing in cognitive radio networks

被引:14
作者
Pourpeighambar, Babak [1 ]
Dehghan, Mehdi [1 ]
Sabaei, Masoud [1 ]
机构
[1] Amirkabir Univ Technol, Comp Engn Dept, Tehran, Iran
关键词
Cognitive radio networks; Non-cooperative routing; End-to-end delay minimization; Interference probability; Reinforcement learning; RESOURCE-ALLOCATION; WIRELESS NETWORKS; CHANNEL; SELECTION; PROTOCOL; SCHEME;
D O I
10.1016/j.comcom.2017.02.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cognitive Radio Networks (CRNs) have emerged to overcome the problem of spectrum scarcity caused by spreading of wireless applications. In CRNs, Secondary Users (SUs) are permitted to access opportunistically the authorized frequency bands owned by Primary Users (PUs). In this paper, we address the problem of routing several flows generated by SUs to a given destination considering the presence of PUs traffic modeled by a more realistic model based on Markov Modulated Poisson Process (MMPP). Each source SU wants to selfishly minimize the end-to-end delay of its flow meanwhile the Quality of Service (QoS) requirements of the PUs would be met. To consider quick adaptation of SUs routing decision to environment changes and non-cooperative interaction of them, we formulate the routing problem as stochastic learning processes featured by non-cooperative games. Then, we propose a distributed reinforcement learning-based scheme for solving the routing problem that can avoid information exchange between the competing SUs. The proposed scheme provably converges and simulation results demonstrate effectiveness of the proposed scheme in decreasing the delay while meeting the QoS requirements of the PUs. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:11 / 23
页数:13
相关论文
共 49 条
  • [1] Abbagnale A., 2010, INT C COMPUTER COMMU, P1
  • [2] Akimaru H, 1999, TELETRAFFIC THEORY A
  • [3] NeXt generation/dynamic spectrum access/cognitive radio wireless networks: A survey
    Akyildiz, Ian F.
    Lee, Won-Yeol
    Vuran, Mehmet C.
    Mohanty, Shantidev
    [J]. COMPUTER NETWORKS, 2006, 50 (13) : 2127 - 2159
  • [4] [Anonymous], TECHNICAL REPORT
  • [5] [Anonymous], J EC BEHAV ORG
  • [6] [Anonymous], TECHNICAL REPORT
  • [7] [Anonymous], 2009, 2009 4 INT C COGN RA, DOI DOI 10.1109/CROWNCOM.2009.5189189
  • [8] [Anonymous], 2015, 2015 IEEE GLOB COMM
  • [9] [Anonymous], 2010, EURASIP J WIRELESS C
  • [10] Multi-Agent Reinforcement Learning Based Opportunistic Routing and Channel Assignment for Mobile Cognitive Radio Ad Hoc Network
    Barve, Sunita S.
    Kulkarni, Parag
    [J]. MOBILE NETWORKS & APPLICATIONS, 2014, 19 (06) : 720 - 730