Policy-Gradient-Based Reinforcement Learning for Computing Resources Allocation in O-RAN

被引：9

作者：

Sharara, Mahdi ^{[1
]}

Pamuklu, Turgay ^{[2
]}

Hoteit, Sahar ^{[1
]}

Veque, Veronique ^{[1
]}

Erol-Kantarci, Melike ^{[2
]}

机构：

[1] Univ Paris Saclay, Lab Signaux & Syst, CNRS, Cent Supelec, Gif Sur Yvette, France

[2] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada

来源：

PROCEEDINGS OF THE 2022 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET 2022) | 2022年

关键词：

O-RAN; Integer Linear Programming; Reinforcement Learning; Computing Resources Allocation; 6G;

D O I：

10.1109/CloudNet55617.2022.9978863

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Open Radio Access Network (O-RAN) is a novel architecture aiming to disaggregate the network components to reduce capital and operational costs and open the interfaces to ensure interoperability. In this work, we consider the problem of allocating computing resources to process the data of enhanced Mobile BroadBand (eMBB) users and Ultra-Reliable Low-Latency (URLLC) Users. Supposing the processing of users' frames from different base stations is done in a shared O-Cloud, we model the computing resources allocation problem as an Integer Linear Programming (ILP) problem that aims at fairly allocating computing resources to eMBB and URLLC users and optimizing the QoS of URLLC users without neglecting eMBB users. Due to the high complexity of solving an ILP problem, we model the problem using Reinforcement Learning (RL). Our results demonstrate the ability of our RL-based solution to perform close to the ILP solver while having much lower computational complexity. For a different number of Open Radio Units (O-RUs), the objective value of the RL agent does not deviate from the ILP objective by more than 6%.

引用

页码：229 / 236

页数：8

共 50 条

[41] ScaRL: Service Function Chain Allocation Based on Reinforcement Learning in Mobile Edge Computing [J].

Jin, Qizhen ;

Ge, Shuxin ;

Zeng, Jiaxin ;

Zhou, Xiaobo ;

Qiu, Tie .

2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, :327-332

[42] Reinforcement learning based monotonic policy for online resource allocation [J].

Mishra, Pankaj ;

Moustafa, Ahmed .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 138 :313-327

[43] Energy Conserved Computation Offloading for O-RAN based IoT systems [J].

Wang, Liping ;

Zhou, Jianhong ;

Wang, Yunxiang ;

Lei, Boyi .

IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, :4043-4048

[44] A Federated Continual Learning Framework for Sustainable Network Anomaly Detection in O-RAN [J].

Benzaied, Chafika ;

Muhtasim Hossain, Fahim ;

Taleb, Tarik ;

Merino Gomez, Pedro ;

Dieudonne, Michael .

2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,

[45] Explainable AI and Statistical Learning for Enhanced Abnormal Detection in O-RAN Networks [J].

Yao, Chih-Hao ;

Chen, Yu-An .

2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, :655-656

[46] Cascade Reinforcement Learning with State Space Factorization for O-RAN-based Traffic Steering [J].

Sun, Chuanneng ;

Jung, Gueyoung ;

Tran, Tuyen X. ;

Pompili, Dario .

2024 21ST ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING, SECON, 2024,

[47] Trustworthy Reputation for Federated Learning in O-RAN Using Blockchain and Smart Contracts [J].

Javed, Farhana ;

Mangues-Bafalluy, Josep ;

Zeydan, Engin ;

Blanco, Luis .

IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2025, 6 :1343-1362

[48] Policy gradient fuzzy reinforcement learning [J].

Wang, XN ;

Xu, X ;

He, HG .

PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, :992-995

[49] Revealing the Threat Landscape of Intent-based Management in O-RAN [J].

Rebecchi, Filippo ;

Cho, Daniel ;

Abdelrazek, Loay ;

Forssell, Henrik ;

Olsson, Jonathan .

PROCEEDINGS OF THE 27TH CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS, ICIN, 2024, :106-113

[50] A survey of public datasets for O-RAN: fostering the development of machine learning models [J].

Couto, Rodrigo S. ;

Cruz, Pedro ;

Pacheco, Roberto G. ;

Souza, Vivian Maria S. ;

Campista, Miguel Elias M. ;

Costa, Luis Henrique M. K. .

ANNALS OF TELECOMMUNICATIONS, 2024, 79 (9-10) :649-662

← 1 2 3 4 5 →