Resource Allocation in V2X Networks: From Classical Optimization to Machine Learning-Based Solutions

被引：5

作者：

Parvini, Mohammad ^{[1
]}

Schulz, Philipp ^{[1
]}

Fettweis, Gerhard ^{[1
]}

机构：

[1] Tech Univ Dresden, Vodafone Chair Mobile Commun Syst, D-01062 Dresden, Germany

来源：

IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY | 2024年 / 5卷

关键词：

Resource management; Reliability; Machine learning algorithms; 3GPP; Reinforcement learning; Prediction algorithms; Power control; Difference of two convex functions (d.c.) programming; optimization; multi-agent reinforcement learning (MARL); platooning; radio resource management (RRM); federated learning (FL); PLATOON CONTROL; REINFORCEMENT; COMMUNICATION; MANAGEMENT;

D O I：

10.1109/OJCOMS.2024.3380509

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

As one of the promising intelligent transportation frameworks, vehicular platooning has the potential to bring about sustainable and efficient mobility solutions. One of the challenges in the development of platooning is maintaining the string stability, which ensures that there is no amplification of the signal of interest along the platoon chain. String stability is dependent on reliable inter-vehicle communications and proper controller design. Therefore, in this paper, we formulate radio resource management (RRM) problem with the purpose of satisfying the reliability of the vehicle-to-vehicle (V2V) links and string stability of the platoon. We tackle the optimization problem from different angles. First, we devise centralized classical approaches based on difference of two convex functions (d.c.) programming, in which we assume the base station (BS) has full knowledge over the V2V channel gains. In the second strategy, we develop decentralized resource allocation approaches based on multi-agent reinforcement learning (MARL). In essence, we model each transmitter vehicle in the platoon as an autonomous agent that tries to find an optimal policy according to its local estimated information to maximize the total expected reward. We also investigate whether the integration of federated learning (FL) with decentralized MARL algorithms can bring any potential benefits. This comparison between classical and machine learning (ML)-based RRM strategies helps us make crucial observations in terms of robustness, sensitivity, and efficacy of the policies that are learned by reinforcement learning (RL)-based resource allocation algorithms.

引用

页码：1958 / 1974

页数：17

共 59 条

[1]

[Anonymous], 2019, Rep. TR 38.885

[2]

[Anonymous], 2022, Rep. TR 37.985

[3]

[Anonymous], 2022, Rep. TS 22.186

[4]

Boche H, 2024, Arxiv, DOI arXiv:2401.10310

[5]

Boyd S., 2004, Convex Optimization

[6]

Chen D, 2024, Arxiv, DOI arXiv:2308.02345

[7] Multi-Agent Reinforcement Learning for Slicing Resource Allocation in Vehicular Networks [J].

Cui, Yaping ;

Shi, Hongji ;

Wang, Ruyan ;

He, Peng ;

Wu, Dapeng ;

Huang, Xinyun .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) :2005-2016

[8]

Fujimoto S, 2018, PR MACH LEARN RES, V80

[9] A Tutorial on 5G NR V2X Communications [J].

Garcia, Mario H. Castaneda ;

Molina-Galan, Alejandro ;

Boban, Mate ;

Gozalvez, Javier ;

Coll-Perales, Baldomero ;

Sahin, Taylan ;

Kousaridas, Apostolos .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03) :1972-2026

[10]

Gonzalez A., 2022, Ph.D. dissertation

← 1 2 3 4 5 6 →