Deep Reinforcement Learning-Based Dynamic Spectrum Access for D2D Communication Underlay Cellular Networks

被引：22

作者：

Huang, Jingfei ^{[1
]}

Yang, Yang ^{[1
]}

He, Gang ^{[1
]}

Xiao, Yang ^{[1
]}

Liu, Jun ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

来源：

IEEE COMMUNICATIONS LETTERS | 2021年 / 25卷 / 08期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Device-to-device communication; Interference; Throughput; Resource management; Cellular networks; Heuristic algorithms; Training; dynamic spectrum access; time slots; deep reinforcement learning; double deep Q-network; RESOURCE-ALLOCATION;

D O I：

10.1109/LCOMM.2021.3079920

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

This letter investigates a deep reinforcement learning (DRL)-based spectrum access scheme for device-to-device (D2D) communication underlay cellular networks. Specifically, cellular users (CUEs) and D2D pairs attempt to access the time slots (TSs) of a shared spectrum, and TSs are dynamically scheduled to CUEs in different frames. Based on the DRL theory, D2D pairs can be seen as a centralized agent which aims to learn an optimal spectrum access strategy to maximize the sum throughput without any prior information. In particular, with different locations of CUEs, the spectrum access manners for D2D communication are changed to ensure the communication quality of CUEs at the cell edge. Then, a double deep Q-network (DDQN) based D2D spectrum access (D(4)SA) algorithm is proposed, which makes D2D pairs learn to decide whether to access the spectrum in different TSs. Moreover, to ensure the fairness of resource allocation among D2D pairs, we improve the proposed algorithm and incorporate fairness into the objective function. Simulation results show that our proposed algorithm can achieve an optimal sum throughput close to the theoretical upper bound, where the performance is significantly improved compared to the scheme based on base station cooperation.

引用

页码：2614 / 2618

页数：5

共 50 条

[21] Interference Management and Coverage Probability Enhancement in D2D Underlay Downlink Cellular Networks [J].

Rahim, V. C. Abdul ;

Prema, S. Chris .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (04) :5843-5855

[22] Dynamic Channel Matching based on Deep Reinforcement Learning for D2D Communications [J].

Li, Zhijie ;

Liu, Zhixin ;

Yuan, Yazhou ;

Wang, Haifeng .

2020 IEEE 18TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), VOL 1, 2020, :870-875

[23] Balancing Fairness and Energy Efficiency in SWIPT-Based D2D Networks: Deep Reinforcement Learning Based Approach [J].

Han, Eun-Jeong ;

Sengly, Muy ;

Lee, Jung-Ryun .

IEEE ACCESS, 2022, 10 :64495-64503

[24] S-MFRL: Spiking Mean Field Reinforcement Learning for Dynamic Resource Allocation of D2D Networks [J].

Ye, Pei-Gen ;

Wang, Yuan-Gen ;

Tang, Weixuan .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) :1032-1047

[25] Learning-based dynamic connectivity maintenance for UAV-assisted D2D multicast communication [J].

Wang, Jingjing ;

Sun, Yanjing ;

Wang, Bowen ;

Qian, Shenshen ;

Tian, Zhijian ;

Wang, Xiaolin .

CHINA COMMUNICATIONS, 2023, 20 (10) :305-322

[26] Simultaneous wireless information and power transfer in heterogeneous cellular networks with underlay D2D communication [J].

Sreelakshmy, K. R. ;

Jacob, Lillykutty .

WIRELESS NETWORKS, 2020, 26 (05) :3315-3330

[27] Resource Allocation Using Particle Swarm Optimization for D2D Communication Underlay of Cellular Networks [J].

Su, Lin ;

Ji, Yusheng ;

Wang, Ping ;

Liu, Fuqiang .

2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, :129-133

[28] Resource Allocation for Underlay Interfering D2D Networks With Multiantenna and Imperfect CSI [J].

Elnourani, Mohamed ;

Deshmukh, Siddharth ;

Beferull-Lozano, Baltasar .

IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (09) :6066-6082

[29] DYNAMIC RESOURCE ALLOCATIONS BASED ON Q-LEARNING FOR D2D COMMUNICATION IN CELLULAR NETWORKS [J].

Luo, Yong ;

Shi, Zhiping ;

Zhou, Xin ;

Liu, Qiaoyan ;

Yi, Qicong .

2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, :385-388

[30] An Approach for Improving Performance of Underlay D2D Communication [J].

Gupta, Shreya ;

Trivedi, Aditya ;

Pawar, Praveen .

2018 CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (CICT'18), 2018,

← 1 2 3 4 5 →