A skipping spectrum sensing scheme based on deep reinforcement learning for transform domain communication systems

被引：1

作者：

Li, Ce ^{[1
]}

Wu, Yanhua ^{[1
]}

Zhu, Rangang ^{[1
]}

Wu, Ruochen ^{[2
]}

Zhang, Zhengkun ^{[1
]}

Wang, Zunhui ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Elect Engn, Hefei 230000, Peoples R China

[2] Southeast Univ, Sch Automat, Nanjing 210000, Peoples R China

来源：

SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期

关键词：

Transform domain communication system; Spectrum sensing; Partially observable Markov decision process; Double Deep Recurrent Q-Network; Dynamic spectrum access; COGNITIVE RADIO; CNN;

D O I：

10.1038/s41598-024-83140-w

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Spectrum sensing is a key technology and prerequisite for Transform Domain Communication Systems (TDCS). The traditional approach typically involves selecting a working sub-band and maintaining it without further changes, with spectrum sensing being conducted periodically. However, this approach presents two main issues: on the one hand, if the selected working band has few idle channels, TDCS devices are unable to flexibly switch sub-bands, leading to reduced performance; on the other hand, periodic sensing consumes time and energy, limiting TDCS's transmission efficiency. In contrast to previous studies that unrealistically modeled the problem as a Markov Decision Process (MDP), this study accounts for the fact that TDCS devices cannot fully observe the entire spectrum state and must rely on historical observations, along with the current state of sub-bands, to make informed decisions. We innovatively model this as a Partially Observable Markov Decision Process (POMDP). Moreover, we consider both the number of skipped time slots and the selection of idle sub-bands, establishing distinct termination conditions for each action. By assigning different weights to balance sensing overhead and spectrum utilization while reducing conflicts, the algorithm's adaptability and performance are improved. To address the Q-value overestimation problem inherent in traditional Deep Recurrent Q-Network (DRQN) due to the use of a single network, we propose a DDRQN-BandShift strategy that combines Double Deep Q-Network (DDQN) and DRQN. Simulation results show that the proposed scheme significantly improves TDCS transmission efficiency while effectively reducing sensing costs.

引用

页数：11

共 25 条

[1] Mitola J., Maguire G.Q., Cognitive radio: Making software radios more personal, IEEE Pers. Commun, 6, pp. 13-18, (1999)
[2] Haykin S., Cognitive radio: Brain-empowered wireless communications, IEEE J. Sel. Areas Commun, 23, pp. 201-220, (2005)
[3] Swackhammer P.J., Temple M.A., Raines R.A., Performance simulation of a transform domain communication system for multiple access applications, In MILCOM 1999. IEEE Military Communications. Conference Proceedings (Cat. No.99Ch36341), 2, pp. 1055-1059, (1999)
[4] Roberts M.L., Temple M.A., Raines R.A., Magee E.P., Initial acquisition performance of a transform domain communication system: Modeling and simulation results, In MILCOM 2000 Proceedings. 21St Century Military Communications. Architectures and Technologies for Information Superiority (Cat. No.00Ch37155), 2, pp. 1119-1123
[5] Klein Randall W., Wavelet Domain Communication System (WDCS): Design, Model, Simulation, and Analysis
[6] Tan Kefengandrian Jeancandocia Frankzhou Chi, . An Enhanced Wavelet Domain Communication System (EWDCS) with Nonstationary Interference Avoidance Capability, In IEEE Vehicular Technology Conference, pp. 1-6
[7] Su H., Et al., TDCS-IDMA system for cognitive radio networks with cloud, IEEE Access, 6, pp. 20520-20530, (2018)
[8] Domain Communication System and Its Anti-Interference Performance Analysis, In 2019 6Th International Conference on Dependable Systems and Their Applications (DSA), pp. 509-510
[9] Liang Yuanda Xinyuzhang Zheliu Huijun, . Design of doublethreshold basic function in transform domain communication system for covert communication, J. Huazhong Univ. Sci. Technol. Nat. Sci. Edi., 45, pp. 11-16, (2017)
[10] Chae K., Park J., Kim Y., Rethinking autocorrelation for deep spectrum sensing in cognitive radio networks, IEEE Internet Things J, 10, pp. 31-41, (2023)

← 1 2 3 →