A Kind of Joint Routing and Resource Allocation Scheme Based on Prioritized Memories-Deep Q Network for Cognitive Radio Ad Hoc Networks

被引:21
作者
Du, Yihang [1 ]
Zhang, Fan [2 ]
Xue, Lei [1 ]
机构
[1] Natl Univ Def Technol, Elect Countermeasure Inst, Hefei 230000, Anhui, Peoples R China
[2] Anhui Xinhua Univ, Sci & Technol Res Bur, Hefei 230000, Anhui, Peoples R China
基金
中国博士后科学基金;
关键词
cognitive radio; joint routing and resource allocation; responsibility rating; Prioritized Memories Deep Q-Network;
D O I
10.3390/s18072119
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Cognitive Radio (CR) is a promising technology to overcome spectrum scarcity, which currently faces lots of unsolved problems. One of the critical challenges for setting up such systems is how to coordinate multiple protocol layers such as routing and spectrum access in a partially observable environment. In this paper, a deep reinforcement learning approach is adopted for solving above problem. Firstly, for the purpose of compressing huge action space in the cross-layer design problem, a novel concept named responsibility rating is introduced to help decide the transmission power of every Secondary User (SU). In order to deal with problem of dimension curse while reducing replay memory, the Prioritized Memories Deep Q-Network (PM-DQN) is proposed. Furthermore, PM-DQN is applied to solve the joint routing and resource allocation problem in cognitive radio ad hoc network for minimizing the transmission delay and power consumption. Simulation results illustrates that our proposed algorithm can reduce the end-to-end delay, packet loss ratio and estimation error while achieving higher energy efficiency compared with traditional algorithm.
引用
收藏
页数:21
相关论文
共 27 条
[1]  
Andre D, 1998, ADV NEUR IN, V10, P1001
[2]  
[Anonymous], 2013, Playing atari with deep reinforcement learning
[3]  
[Anonymous], 2015, P INT C LEARN REPR I
[4]   Multi-Agent Reinforcement Learning Based Opportunistic Routing and Channel Assignment for Mobile Cognitive Radio Ad Hoc Network [J].
Barve, Sunita S. ;
Kulkarni, Parag .
MOBILE NETWORKS & APPLICATIONS, 2014, 19 (06) :720-730
[5]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[6]   A NOTE ON THE GENERATION OF RANDOM NORMAL DEVIATES [J].
BOX, GEP ;
MULLER, ME .
ANNALS OF MATHEMATICAL STATISTICS, 1958, 29 (02) :610-611
[7]   Routing in cognitive radio networks: Challenges and solutions [J].
Cesana, Matteo ;
Cuomo, Francesca ;
Ekici, Eylem .
AD HOC NETWORKS, 2011, 9 (03) :228-248
[8]   Cooperative Routing for Underlay Cognitive Radio Networks Using Mutual-Information Accumulation [J].
Chen, Hao ;
Liu, Lingjia ;
Matyjas, John D. ;
Medley, Michael J. .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (12) :7110-7122
[9]   Stochastic Power Adaptation with Multiagent Reinforcement Learning for Cognitive Wireless Mesh Networks [J].
Chen, Xianfu ;
Zhao, Zhifeng ;
Zhang, Honggang .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2013, 12 (11) :2155-2166
[10]   Cross-Layer Routing and Dynamic Spectrum Allocation in Cognitive Radio Ad Hoc Networks [J].
Ding, Lei ;
Melodia, Tommaso ;
Batalama, Stella N. ;
Matyjas, John D. ;
Medley, Michael J. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2010, 59 (04) :1969-1979