Intelligent Spectrum Sensing and Access With Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning

被引：2

作者：

Li, Xuanheng ^{[1
]}

Zhang, Yulong ^{[1
]}

Ding, Haichuan ^{[2
]}

Fang, Yuguang ^{[3
]}

机构：

[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 16024, Peoples R China

[2] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China

[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2024年 / 23卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Dynamic spectrum access (DSA); partial spectrum sensing; power allocation; hierarchical deep reinforcement learning; multi-agent; UNCERTAIN SHARED SPECTRUMS; COGNITIVE RADIO NETWORKS; OPTIMIZATION; ALLOCATION; 6G;

D O I：

10.1109/TWC.2023.3305567

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Dynamic spectrum access (DSA) has been regarded as a viable solution to the spectrum shortage problem. To find idle spectrum, partial spectrum sensing could be employed by selecting a suitable sensing window (SW). Since the SW selection determines how many available bands to access, the transmission performance after the access could be used to guide the SW selection. Hence, a sophisticated joint design on spectrum sensing and access is necessary, which, however, is a challenging task when considering the dynamic nature of spectrum environment, and also the mutual impact among different secondary users (SUs). In this paper, we propose a joint partial spectrum sensing and power allocation (PA) scheme to facilitate SUs to make the best decisions on SW and PA to maximize the network throughput with reduced mutual interference. Considering the environmental dynamics and spectrum uncertainty, we develop a viable solution based on hierarchical multi-agent deep reinforcement learning (HMADRL). Our solution enables mutual design with two stages: making each SU learn the best SW and PA strategies autonomously while adapting to the dynamic environment. By using both simulated spectrum data and real spectrum data measured by SAM60-BX, we have demonstrated the effectiveness of our proposed scheme.

引用

页码：3131 / 3145

页数：15

共 47 条

[1] Enhanced Dynamic Spectrum Access in Multiband Cognitive Radio Networks via Optimized Resource Allocation [J].

Bhardwaj, Piyush ;

Panwar, Ankita ;

Ozdemir, Onur ;

Masazade, Engin ;

Kasperovich, Irina ;

Drozd, Andrew L. ;

Mohan, Chilukuri K. ;

Varshney, Pramod K. .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2016, 15 (12) :8093-8106

[2] Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks [J].

Bokobza, Yoel ;

Dabora, Ron ;

Cohen, Kobi .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (07) :4930-4946

[3] Federated Multi-Agent Deep Reinforcement Learning (Fed-MADRL) for Dynamic Spectrum Access [J].

Chang, Hao-Hsuan ;

Song, Yifei ;

Doan, Thinh T. ;

Liu, Lingjia .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (08) :5337-5348

[4] Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum Sharing for 5G and Beyond [J].

Chang, Hao-Hsuan ;

Liu, Lingjia ;

Yi, Yang .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (03) :929-939

[5] Distributive Dynamic Spectrum Access Through Deep Reinforcement Learning: A Reservoir Computing-Based Approach [J].

Chang, Hao-Hsuan ;

Song, Hao ;

Yi, Yang ;

Zhang, Jianzhong ;

He, Haibo ;

Liu, Lingjia .

IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) :1938-1948

[6] Joint design and separation principle for opportunistic spectrum access in the presence of sensing errors [J].

Chen, Yunxia ;

Zhao, Qing ;

Swami, Ananthram .

IEEE TRANSACTIONS ON INFORMATION THEORY, 2008, 54 (05) :2053-2071

[7] Deep-Dual-Learning-Based Cotask Processing in Multiaccess Edge Computing Systems [J].

Chiang, Yi-Han ;

Chiang, Tsung-Wei ;

Zhang, Tianyu ;

Ji, Yusheng .

IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) :9383-9398

[8] Energy-Efficient Channel Switching in Cognitive Radio Networks: A Reinforcement Learning Approach [J].

Ding, Haichuan ;

Li, Xuanheng ;

Ma, Ying ;

Fang, Yuguang .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) :12359-12362

[9] Cognitive Capacity Harvesting Networks: Architectural Evolution Toward Future Cognitive Radio Networks [J].

Ding, Haichuan ;

Fang, Yuguang ;

Huang, Xiaoxia ;

Pan, Miao ;

Li, Pan ;

Glisic, Savo .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2017, 19 (03) :1902-1923

[10] A Cooperative Spectrum Sensing With Multi-Agent Reinforcement Learning Approach in Cognitive Radio Networks [J].

Gao, Ang ;

Du, Chengyuan ;

Ng, Soon Xin ;

Liang, Wei .

IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) :2604-2608

← 1 2 3 4 5 →