Multiagent Deep-Reinforcement-Learning-Based Resource Allocation for Heterogeneous QoS Guarantees for Vehicular Networks

被引:54
作者
Tian, Jie [1 ]
Liu, Qianqian [1 ]
Zhang, Haixia [2 ,3 ]
Wu, Dalei [4 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[2] Shandong Univ, Shandong Prov Key Lab Wireless Commun Technol, Jinan 250061, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[4] Univ Tennessee, Dept Comp Sci & Engn, Chattanooga, TN 37403 USA
来源
IEEE INTERNET OF THINGS JOURNAL | 2022年 / 9卷 / 03期
基金
中国国家自然科学基金;
关键词
Resource management; Quality of service; Optimization; Reinforcement learning; Training; Entertainment industry; Copper; Deep reinforcement learning (DRL); heterogeneous applications; multi-agent deep deterministic policy gradient (MADDPG); resource allocation; LOW-LATENCY; SCHEME;
D O I
10.1109/JIOT.2021.3089823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vehicle-to-vehicle communications can offer direct information interaction, including security-centered information and entertainment information. However, the rapid proliferation of vehicles and the diversity of communications services demand for a more intelligent and efficient resource allocation framework to enhance network performance. In this article, a multi-agent deep reinforcement learning-based resource allocation framework is developed to jointly optimize the channel allocation and power control to satisfy the heterogeneous Quality-of-Service (QoS) requirements in heterogeneous vehicular networks. In the proposed framework, the utility maximization problem is formulated by considering two types of traffics, i.e., the strict ultrareliable and low-latency requirements for safety-centric applications and the high-capacity requirements for entertainment applications. The utility of each vehicular users is formulated as a multicriterion objective function by taking into account the heterogeneous traffic requirements. To overcome the drawbacks of the traditional totally centralized and distributed deep reinforcement learning-based resource allocation approaches, we propose a multi-agent deep deterministic policy gradient algorithm with centralized learning and decentralized execution to solve the formulated optimization problem. The normalization of the input states and reward functions is introduced to speed up the training and learning progress of the proposed algorithm. Simulation results show the superiority of the proposed algorithm in terms of the convergence and system performance through the comparison with the other methods and schemes for the delay-sensitive applications and delay-tolerant applications.
引用
收藏
页码:1683 / 1695
页数:13
相关论文
共 44 条
[1]  
Arani A. H., 2016, PROC IEEE INT C COMM, P1
[2]  
Chen MM, 2019, IEEE WCNC
[3]   Resource Allocation for Device-to-Device Communications Underlaying Heterogeneous Cellular Networks Using Coalitional Games [J].
Chen, Yali ;
Ai, Bo ;
Niu, Yong ;
Guan, Ke ;
Han, Zhu .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (06) :4163-4176
[4]   Position-Based User-Centric Radio Resource Management in 5G UDN for Ultra-Reliable and Low-Latency Vehicular Communications [J].
Ding, Liqin ;
Wang, Yang ;
Wu, Peng ;
Zhang, Jiliang .
2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2019,
[5]   Resource Allocation for High-Reliability Low-Latency Vehicular Communications With Packet Retransmission [J].
Guo, Chongtao ;
Liang, Le ;
Li, Geoffrey Ye .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (07) :6219-6230
[6]   Resource Allocation in Vehicular Communications using Graph and Deep Reinforcement Learning [J].
Gyawali, Sohan ;
Qian, Yi ;
Hu, Rose Qingyang .
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[7]   Resource Allocation Schemes Based on Coalition Games for Vehicular Communications [J].
He, Chunlong ;
Chen, Qian ;
Pan, Cunhua ;
Li, Xingquan ;
Zheng, Fu-Chun .
IEEE COMMUNICATIONS LETTERS, 2019, 23 (12) :2340-2343
[8]   QoE-Based Resource Allocation for Heterogeneous Multi-Radio Communication in Software-Defined Vehicle Networks [J].
Huang, Wei ;
Ding, Lianghui ;
Meng, De ;
Hwang, Jenq-Neng ;
Xu, Yiling ;
Zhang, Wenjun .
IEEE ACCESS, 2018, 6 :3387-3399
[9]  
Ioffe S., 2015, P 32 INT C MACHINE L, P448
[10]   Distributed Deep Deterministic Policy Gradient for Power Allocation Control in D2D-Based V2V Communications [J].
Khoi Khac Nguyen ;
Trung Q Duong ;
Ngo Anh Vien ;
Nhien-An Le-Khac ;
Long D Nguyen .
IEEE ACCESS, 2019, 7 :164533-164543