Policy-Gradient-Based Reinforcement Learning for Computing Resources Allocation in O-RAN

被引：8

作者：

Sharara, Mahdi ^{[1
]}

Pamuklu, Turgay ^{[2
]}

Hoteit, Sahar ^{[1
]}

Veque, Veronique ^{[1
]}

Erol-Kantarci, Melike ^{[2
]}

机构：

[1] Univ Paris Saclay, Lab Signaux & Syst, CNRS, Cent Supelec, Gif Sur Yvette, France

[2] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada

来源：

PROCEEDINGS OF THE 2022 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET 2022) | 2022年

关键词：

O-RAN; Integer Linear Programming; Reinforcement Learning; Computing Resources Allocation; 6G;

D O I：

10.1109/CloudNet55617.2022.9978863

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Open Radio Access Network (O-RAN) is a novel architecture aiming to disaggregate the network components to reduce capital and operational costs and open the interfaces to ensure interoperability. In this work, we consider the problem of allocating computing resources to process the data of enhanced Mobile BroadBand (eMBB) users and Ultra-Reliable Low-Latency (URLLC) Users. Supposing the processing of users' frames from different base stations is done in a shared O-Cloud, we model the computing resources allocation problem as an Integer Linear Programming (ILP) problem that aims at fairly allocating computing resources to eMBB and URLLC users and optimizing the QoS of URLLC users without neglecting eMBB users. Due to the high complexity of solving an ILP problem, we model the problem using Reinforcement Learning (RL). Our results demonstrate the ability of our RL-based solution to perform close to the ILP solver while having much lower computational complexity. For a different number of Open Radio Units (O-RUs), the objective value of the RL agent does not deviate from the ILP objective by more than 6%.

引用

页码：229 / 236

页数：8

共 24 条

[1]

[Anonymous], 2020, SLIC ARCH, P1

[2]

[Anonymous], 2018, TS138214V1530 ETSI

[3]

[Anonymous], 2019, CLOUD ARCH DEPL SCEN, P1

[4]

[Anonymous], 2021, ARCH DESCR, P1

[5] On some properties of the proportional fair scheduling policy [J].

Avidor, D ;

Mukherjee, S ;

Ling, J ;

Papadias, C .

2004 IEEE 15TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, VOLS 1-4, PROCEEDINGS, 2004, :853-858

[6] Intelligence and Learning in O-RAN for Data-Driven NextG Cellular Networks [J].

Bonati, Leonardo ;

D'Oro, Salvatore ;

Polese, Michele ;

Basagni, Stefano ;

Melodia, Tommaso .

IEEE COMMUNICATIONS MAGAZINE, 2021, 59 (10) :21-27

[7] 6G Wireless Communication Systems: Applications, Requirements, Technologies, Challenges, and Research Directions [J].

Chowdhury, Mostafa Zaman ;

Shahjalal, Md ;

Ahmed, Shakil ;

Jang, Yeong Min .

IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 :957-975

[8]

D'Oro S, 2022, Arxiv, DOI arXiv:2203.02370

[9]

DOro S., 2022, arXiv

[10]

Elsayed M, 2019, 2019 IEEE 2ND 5G WORLD FORUM (5GWF), P590, DOI [10.1109/5GWF.2019.8911618, 10.1109/5gwf.2019.8911618]

← 1 2 3 →