Hybrid Centralized-Distributed Resource Allocation Based on Deep Reinforcement Learning for Cooperative D2D Communications

被引:1
作者
Yu, Yang [1 ]
Tang, Xiaoqing [2 ]
机构
[1] Hubei Three Gorges Polytech, Elect Informat Sch, Yichang 443000, Peoples R China
[2] Hubei Univ, Sch Artificial Intelligence, Wuhan 430062, Peoples R China
关键词
Device-to-device communication; Resource management; Copper; Wireless communication; Relays; Power control; Interference; Heuristic algorithms; Energy efficiency; Cellular networks; Cooperative communication; deep reinforcement learning; device-to-device communication; energy efficiency; power control; spectrum allocation; TO-DEVICE COMMUNICATIONS; POWER ALLOCATION; NETWORKS;
D O I
10.1109/ACCESS.2024.3521590
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Device-to-device (D2D) technology enables direct communication between adjacent devices within cellular networks. Due to its high data rate, low latency, and performance improvement in spectrum and energy efficiency, it has been widely investigated and applied as a critical technology in 5G New Radio (NR). Cooperative D2D communication can achieve a win-win situation between cellular users (CUs) and D2D users (DUs) through cooperative relaying techniques. In addition to conventional overlay and underlay D2D communications, it has attracted extensive attention from academic and industrial circles in the past decade. This paper delves into optimizing joint spectrum allocation, power control, and link-matching between multiple CUs and DUs for cooperative D2D communications. Weighted sum energy efficiency (WSEE) is used as the performance metric to address the challenges of green communication and sustainable development. This mixed-integer fractional programming (MIFP) problem can be decomposed into: 1. a classic weighted bipartite graph matching; 2. a series of nonconvex spectrum allocation and power control problems between potentially matched cellular and D2D link pairs. To address this issue, we propose a hybrid centralized-distributed scheme based on deep reinforcement learning (DRL) and the Kuhn-Munkres (KM) algorithm. Leveraging the former, the CUs and DUs autonomously optimize spectrum allocation and power control by only utilizing local information. Then, the base station (BS) determines the link matching utilizing the latter. Simulation results reveal that it achieves more than 96% WSEE of the optimal scheme and 98% WSEE of the centralized DRL-based scheme. It significantly enhances the network convergence speed with low centralized computational overheads. In addition, we also propose and utilize cooperative link sets for corresponding D2D links to accelerate the proposed scheme and reduce signaling exchange further: an average of about 85% WSEE of the optimal scheme is achieved, while more than 50% of signaling and distributed computing overheads are reduced.
引用
收藏
页码:196609 / 196623
页数:15
相关论文
共 53 条
[1]   Distributed Power Allocation for D2D Communications Underlaying/Overlaying OFDMA Cellular Networks [J].
Abrardo, Andrea ;
Moretti, Marco .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (03) :1466-1479
[2]  
Achiam J, 2017, Arxiv, DOI [arXiv:1705.10528, 10.48550/ARXIV.1705.10528, DOI 10.48550/ARXIV.1705.10528]
[3]  
Ali Z, 2021, IEEE ACCESS, V9, P89554, DOI [10.1109/ACCESS.2021.3090855, 10.1109/access.2021.3090855]
[4]   Spectrum Efficient Mode Selection and Resource Allocation Optimization for D2D Communication in HetNet: A Multi-Agent Q-Learning Approach [J].
Alibraheemi, Ali Majid Hasan ;
Hindia, Mhd Nour ;
Tengku Mohmed Noor Izam, Tengku Faiz ;
Dimyati, Kaharudin .
IEEE ACCESS, 2024, 12 :131217-131229
[5]   Stacked Intelligent Metasurface-Aided MIMO Transceiver Design [J].
An, Jiancheng ;
Yuen, Chau ;
Xu, Chao ;
Li, Hongbin ;
Ng, Derrick Wing Kwan ;
Di Renzo, Marco ;
Debbah, Merouane ;
Hanzo, Lajos .
IEEE WIRELESS COMMUNICATIONS, 2024, 31 (04) :123-131
[6]   Stacked Intelligent Metasurfaces for Efficient Holographic MIMO Communications in 6G [J].
An, Jiancheng ;
Xu, Chao ;
Ng, Derrick Wing Kwan ;
Alexandropoulos, George C. ;
Huang, Chongwen ;
Yuen, Chau ;
Hanzo, Lajos .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (08) :2380-2396
[7]   Achieving Sustainable Ultra-Dense Heterogeneous Networks for 5G [J].
An, Jianping ;
Yang, Kai ;
Wu, Jinsong ;
Ye, Neng ;
Guo, Song ;
Liao, Zhifang .
IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (12) :84-90
[8]  
[Anonymous], 1997, Convex Analysis. Princeton Landmarks in Mathematics
[9]   A Survey on Device-to-Device Communication in Cellular Networks [J].
Asadi, Arash ;
Wang, Qing ;
Mancuso, Vincenzo .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (04) :1801-1819
[10]   COOPERATIVE DEVICE-TO-DEVICE COMMUNICATIONS IN CELLULAR NETWORKS [J].
Cao, Yang ;
Jiang, Tao ;
Wang, Chonggang .
IEEE WIRELESS COMMUNICATIONS, 2015, 22 (03) :124-129