Multi-User Opportunistic Spectrum Access for Cognitive Radio Networks Based on Multi-Head Self-Attention and Multi-Agent Deep Reinforcement Learning

被引：0

作者：

Bai, Weiwei ^{[1
]}

Zheng, Guoqiang ^{[1
]}

Xia, Weibing ^{[2
]}

Mu, Yu ^{[1
]}

Xue, Yujun ^{[3
]}

机构：

[1] Henan Univ Sci & Technol, Coll Informat Engn, Luoyang 471023, Peoples R China

[2] Luoyang Artificial Intelligence Res Inst Co Ltd, Luoyang 471000, Peoples R China

[3] Henan Univ Sci & Technol, Sch Mechatron Engn, Luoyang 471023, Peoples R China

来源：

SENSORS | 2025年 / 25卷 / 07期

基金：

中国国家自然科学基金;

关键词：

opportunistic spectrum access; throughput; multi-head self-attention; multi-agent deep reinforcement learning; multi-user spectrum access; cognitive radio network;

D O I：

10.3390/s25072025

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Aiming to address the issue of multi-user dynamic spectrum access in an opportunistic mode in cognitive radio networks leading to low sum throughput, we propose a multi-user opportunistic spectrum access method based on multi-head self-attention and multi-agent deep reinforcement learning. First, an optimization model for joint channel selection and power control in multi-user systems is constructed based on centralized training with a decentralized execution framework. In the training phase, the decision-making policy is optimized using global information, while in the execution phase, each agent makes decisions according to its observations. Meanwhile, a multi-constraint dynamic proportional reward function is designed to guide the agent in selecting more rational actions by refining the constraints and dynamically adjusting the reward proportion. Furthermore, a multi-head self-attention mechanism is incorporated into the critic network to dynamically allocate attention weights to different users, thereby enhancing the ability of the network to estimate the joint action value. Finally, the proposed method is evaluated in terms of convergence, throughput, and dynamic performance. Simulation results demonstrate that the proposed method significantly improves the sum throughput of secondary users in opportunistic spectrum access.

引用

页数：23

共 28 条

[21] An SVM-Based Feature Detection Scheme for Spatial Spectrum Sensing [J].

Tang, Lihao ;

Zhao, Lei ;

Jiang, Yuan .

IEEE COMMUNICATIONS LETTERS, 2023, 27 (08) :2132-2136

[22] Statistical Feature-Based SVM Wideband Sensing [J].

Varma, Ashwini Kumar ;

Mitra, Debjani .

IEEE COMMUNICATIONS LETTERS, 2020, 24 (03) :581-584

[23]

Vaswani A, 2017, ADV NEUR IN, V30

[24] Deep Learning-Based Spectrum Sensing in Cognitive Radio: A CNN-LSTM Approach [J].

Xie, Jiandong ;

Fang, Jun ;

Liu, Chang ;

Li, Xuanheng .

IEEE COMMUNICATIONS LETTERS, 2020, 24 (10) :2196-2200

[25] Decentralized Opportunistic Channel Access in CRNs Using Big-Data Driven Learning Algorithm [J].

Xu, Zuohong ;

Zhang, Zhou ;

Wang, Shilian ;

Jolfaei, Alireza ;

Bashir, Ali Kashif ;

Yan, Ye ;

Mumtaz, Shahid .

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (01) :57-69

[26] Gaussian Process Reinforcement Learning for Fast Opportunistic Spectrum Access [J].

Yan, Zun ;

Cheng, Peng ;

Chen, Zhuo ;

Li, Yonghui ;

Vucetic, Branka .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 :2613-2628

[27] Steady-State Rate-Optimal Power Adaptation in Energy Harvesting Opportunistic Cognitive Radios With Spectrum Sensing and Channel Estimation Errors [J].

Yazdani, Hassan ;

Vosoughi, Azadeh .

IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2021, 5 (04) :2104-2120

[28] Multi-Channel Opportunistic Access for Heterogeneous Networks Based on Deep Reinforcement Learning [J].

Ye, Xiaowen ;

Yu, Yiding ;

Fu, Liqun .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (02) :794-807

← 1 2 3 →