Deep Reinforcement Learning-Based Optimization Method for D2D Communication Energy Efficiency in Heterogeneous Cellular Networks

被引:6
作者
Pan, Ziyu [1 ]
Yang, Jie [1 ]
机构
[1] Nanjing Inst Technol, Sch Informat & Commun, Nanjing 211167, Peoples R China
基金
中国国家自然科学基金;
关键词
Device-to-device communication; Energy efficiency; Optimization; Quality of service; Cellular networks; Resource management; Base stations; Heterogeneous networks; Telecommunication traffic; Communication energy efficiency; D2D communication; DRL; heterogeneous cellular networks; RESOURCE-MANAGEMENT; MODE SELECTION; ALGORITHM; ALLOCATION;
D O I
10.1109/ACCESS.2024.3467393
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given the context of the challenge of exponentially increasing data traffic on communication networks brought by the 5G era, this paper focuses on how to apply deep reinforcement learning (DRL) techniques to solve the problem of optimizing the energy efficiency of D2D communications in heterogeneous cellular network environments. We propose a joint resource allocation scheme based on multi-intelligent deep reinforcement learning, which enables D2D devices to intelligently switch between license-free and optimal license bands and adjust the transmit power in real time to maximize the energy efficiency improvement. In this work, a multi-intelligent deep reinforcement learning framework is designed to enable D2D users in heterogeneous networks to make collaborative decisions and dynamically adjust their communication strategies according to real-time network status and environmental changes. In this paper, a deep Q-network model with a graph attention network (GAT) as the core structure is constructed; this model can cope with the complexity and diversity of network states and learn and execute optimal resource allocation strategies. In this paper, we propose a targeted loss function design that balances the optimization goal of D2D communication energy efficiency with network stability and long-term gains. Through rigorous simulation experiments, this paper verifies that a DRL-based approach can significantly improve the energy efficiency of D2D communications in heterogeneous cellular networks in real-world scenarios while ensuring the stability of the quality of service (in terms of, e.g., rate, delay, and resource utilization).
引用
收藏
页码:140439 / 140455
页数:17
相关论文
共 60 条
[21]   Robust Multi-Objective Optimization for EE-SE Tradeoff in D2D Communications Underlaying Heterogeneous Networks [J].
Hao, Yuanyuan ;
Ni, Qiang ;
Li, Hai ;
Hou, Shujuan .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2018, 66 (10) :4936-4949
[22]   Energy-efficient relay selection and power allocation for multi-source multicast network-coded D2D communications [J].
Hayati, Maryam ;
Kalbkhani, Hashem ;
Shayesteh, Mahrokh G. .
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2021, 128
[23]   D2D-enabled resource management in secrecy-ensured 5G and beyond Heterogeneous networks [J].
Irrum, Fauzia ;
Ali, Mudassar ;
Naeem, Muhammad ;
Anpalagan, Alagan ;
Qaisar, Saad ;
Qamar, Farhan .
PHYSICAL COMMUNICATION, 2021, 45
[24]   Survey on the state-of-the-art in device-to-device communication: A resource allocation perspective [J].
Islam, Tariq ;
Kwon, Cheolhyeon .
AD HOC NETWORKS, 2022, 136
[25]   PCF-Based LTE Wi-Fi Aggregation for Coordinating and Offloading the Cellular Traffic to D2D Network [J].
Ismaiel, Bushra ;
Abolhasan, Mehran ;
Ni, Wei ;
Smith, David ;
Franklin, Daniel ;
Dutkiewicz, Eryk ;
Krunz, Marwan M. ;
Jamalipour, Abbas .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (12) :12193-12203
[26]   Content Caching and Channel Allocation in D2D-Assisted Wireless HetNets [J].
Jaafar, Wael ;
Mseddi, Amina ;
Ajib, Wessam ;
Elbiaze, Halima .
IEEE ACCESS, 2021, 9 :112502-112515
[27]   DRL-Based Resource Allocation for NOMA-Enabled D2D Communications Underlay Cellular Networks [J].
Jeong, Yun Jae ;
Yu, Seoyoung ;
Lee, Jeong Woo .
IEEE ACCESS, 2023, 11 :140270-140286
[28]   Energy-spectrum-efficient three-tier heterogeneous networks with D2D harvesting energy and uplink coverage analysis [J].
Ji, Shanshan ;
Jia, Xiangdong ;
Fan, Qiaoling ;
Xie, Mangang ;
Zhou, Meng .
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2018, 31 (13)
[29]   Power Optimization in Device-to-Device Communications: A Deep Reinforcement Learning Approach With Dynamic Reward [J].
Ji, Zelin ;
Kiani, Adnan K. ;
Qin, Zhijin ;
Ahmad, Rizwan .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (03) :508-511
[30]   Connectivity Mode Management for User Devices in Heterogeneous D2D Networks [J].
Kafiloglu, S. Sinem ;
Gur, Gurkan ;
Alagoz, Fatih .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (01) :194-198