Risk-Aware Reinforcement Learning Framework for User-Centric O-RAN

被引：0

作者：

Kasi, Shahrukh Khan ^{[1
]}

Khan, Fahd Ahmed ^{[1
]}

Ekin, Sabit ^{[2
]}

Imran, Ali ^{[1
,3
]}

机构：

[1] Univ Oklahoma, AI4Networks Res Ctr, Sch Elect & Comp Engn, Norman, OK 73019 USA

[2] Texas A&M Univ, Sch Elect & Comp Engn, College Stn, TX 77840 USA

[3] Univ Glasgow, James Watt Sch Engn, Glasgow G12 8QQ, Scotland

来源：

IEEE TRANSACTIONS ON MACHINE LEARNING IN COMMUNICATIONS AND NETWORKING | 2025年 / 3卷

基金：

美国国家科学基金会;

关键词：

Optimization; Open RAN; Training; Cellular networks; Computer architecture; Reliability; Convergence; Resource management; Energy efficiency; Quality of service; User-centric; O-RAN; reinforcement learning; risk-aware; 6G and beyond;

D O I：

10.1109/TMLCN.2025.3534139

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The evolution of Open Radio Access Networks (O-RAN) presents an opportunity to enhance network performance by enabling dynamic orchestration of configuration and optimization parameters (COPs) through online learning methods. However, leveraging this potential requires overcoming the limitations of traditional cell-centric RAN architectures, which lack the necessary flexibility. On the other hand, despite their recent popularity, the practical deployment of online learning frameworks, such as Deep Reinforcement Learning (DRL)-based COP optimization solutions, remains limited due to their risk of deteriorating network performance during the exploration phase. In this article, we propose and analyze a novel risk-aware DRL framework for user-centric RAN (UC-RAN), which offers both the architectural flexibility and COP optimization to exploit this flexibility. We investigate and identify UC-RAN COPs that can be optimized via a soft actor-critic algorithm implementable as an O-RAN application (rApp) to jointly maximize latency satisfaction, reliability satisfaction, area spectral efficiency, and energy efficiency. We use the offline learning on UC-RAN to reliably accelerate DRL training, thus minimizing the risk of DRL deteriorating cellular network performance. Results show that our proposed solution approaches near-optimal performance in just a few hundred iterations with a decrease in risk score by a factor of ten.

引用

页码：195 / 214

页数：20

共 38 条

[1] Autonomous Helicopter Aerobatics through Apprenticeship Learning [J].

Abbeel, Pieter ;

Coates, Adam ;

Ng, Andrew Y. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2010, 29 (13) :1608-1639

[2]

Achiam J, 2017, PR MACH LEARN RES, V70

[3] A Survey of Self Organisation in Future Cellular Networks [J].

Aliu, Osianoh Glenn ;

Imran, Ali ;

Imran, Muhammad Ali ;

Evans, Barry .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2013, 15 (01) :336-361

[4]

Amodei D, 2016, Arxiv, DOI [arXiv:1606.06565, 10.48550/arXiv.1606.06565]

[5] Latency-Aware Near-Real-Time RIC Deployment in User-Centric RAN With Cell-Free Massive MIMO: A Telecom Operator Perspective [J].

Amrallah, Amr ;

Murakami, Takahide ;

Tsukamoto, Yu ;

Ikami, Akio ;

Shinbo, Hiroyuki ;

Amano, Yoshiaki .

2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,

[6]

[Anonymous], 2024, Service Requirements for the 5G System, document TS 22.261

[7]

[Anonymous], 2024, Study on Channel Model for Frequencies From 0.5 to 100 GHz, document (TR) 38.901

[8] User Centric Cell-Free Massive MIMO in the O-RAN Architecture: Signalling and Algorithm Integration [J].

Beerten, Robbert ;

Girycki, Adam ;

Pollin, Sofie .

2022 IEEE CONFERENCE ON STANDARDS FOR COMMUNICATIONS AND NETWORKING, CSCN, 2022, :181-187

[9] Powder: Platform for Open Wireless Data-driven Experimental Research [J].

Breen, Joe ;

Buffmire, Andrew ;

Duerig, Jonathon ;

Dutt, Kevin ;

Eide, Eric ;

Hibler, Mike ;

Johnson, David ;

Kasera, Sneha Kumar ;

Lewis, Earl ;

Maas, Dustin ;

Orange, Alex ;

Patwari, Neal ;

Reading, Daniel ;

Ricci, Robert ;

Schurig, David ;

Stoller, Leigh B. ;

Van der Merwe, Jacobus ;

Webb, Kirk ;

Wong, Gary .

PROCEEDINGS OF THE FOURTEENTH ACM INTERNATIONAL WORKSHOP ON WIRELESS NETWORK TESTBEDS, EXPERIMENTAL EVALUATION & CHARACTERIZATION (WINTECH '20), 2020, :17-24

[10]

Forsk, Forsk Atoll

← 1 2 3 4 →