Cascade Reinforcement Learning with State Space Factorization for O-RAN-based Traffic Steering

被引：1

作者：

Sun, Chuanneng ^{[1
]}

Jung, Gueyoung ^{[1
]}

Tran, Tuyen X. ^{[1
]}

Pompili, Dario ^{[1
]}

机构：

[1] Rutgers Univ New Brunswick, Dept Elect & Comp Engn, New Brunswick, NJ 08901 USA

来源：

2024 21ST ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING, SECON | 2024年

关键词：

O-RAN; traffic steering; reinforcement learning;

D O I：

10.1109/SECON64284.2024.10934854

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We study the Traffic Steering (TS) problem in Open Radio Access Network (O-RAN), leveraging its RAN Intelligent Controller (RIC), in which RAN configuration parameters of cells can be jointly and dynamically optimized in near-real-time. To address the TS problem, we propose a novel Cascade Reinforcement Learning (CaRL) framework, where we propose state space factorization and policy decomposition to mitigate the need for large complex models and well-labeled datasets. For each sub-state space, an RL sub-policy is trained to optimize the Quality of Service (QoS). To apply CaRL to new network areas, we propose a knowledge transfer approach to initialize a new sub-policy based on knowledge learned by the trained policies. To evaluate CaRL, we build a data-driven and scalable RIC Digital Twin (DT) that is modeled using real-world data, including network setup, user geo-distribution, and traffic demand, among others, from a tier-1 RAN operator. We evaluated CaRL in two DT scenarios representing two different US cities and compared its performance with business-as-usual policy as a baseline and other competing optimization approaches (i.e., heuristic and Q-table algorithms). Furthermore, we have conducted a field trial with the RAN operator to evaluate the performance of CaRL in two areas in the Northeast US regions.

引用

页数：9

共 33 条

[1] Q-Learning Based Intelligent Traffic Steering in Heterogeneous Network [J].

Adachi, Koichi ;

Li, Maodong ;

Tan, Peng Hui ;

Zhou, Yuan ;

Sun, Sumei .

2016 IEEE 83RD VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2016,

[2]

Bin Peng X, 2019, Arxiv, DOI arXiv:1910.00177

[3] Powder: Platform for Open Wireless Data-driven Experimental Research [J].

Breen, Joe ;

Buffmire, Andrew ;

Duerig, Jonathon ;

Dutt, Kevin ;

Eide, Eric ;

Hibler, Mike ;

Johnson, David ;

Kasera, Sneha Kumar ;

Lewis, Earl ;

Maas, Dustin ;

Orange, Alex ;

Patwari, Neal ;

Reading, Daniel ;

Ricci, Robert ;

Schurig, David ;

Stoller, Leigh B. ;

Van der Merwe, Jacobus ;

Webb, Kirk ;

Wong, Gary .

PROCEEDINGS OF THE FOURTEENTH ACM INTERNATIONAL WORKSHOP ON WIRELESS NETWORK TESTBEDS, EXPERIMENTAL EVALUATION & CHARACTERIZATION (WINTECH '20), 2020, :17-24

[4] 6G Wireless Communication Systems: Applications, Requirements, Technologies, Challenges, and Research Directions [J].

Chowdhury, Mostafa Zaman ;

Shahjalal, Md ;

Ahmed, Shakil ;

Jang, Yeong Min .

IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 :957-975

[5]

Gomez-Miguelez Ismael., 2016, P 10 ACM INT WORKSHO, P25, DOI [10.1145/2980159.2980163, DOI 10.1145/2980159.2980163]

[6]

Haarnoja T, 2018, PR MACH LEARN RES, V80

[7] Vivisecting Mobility Management in 5G Cellular Networks [J].

Hassan, Ahmad ;

Narayanan, Arvind ;

Zhang, Anlan ;

Ye, Wei ;

Zhu, Ruiyang ;

Jin, Shuowei ;

Carpenter, Jason ;

Mao, Z. Morley ;

Qian, Feng ;

Zhang, Zhi-Li .

SIGCOMM '22: PROCEEDINGS OF THE 2022 ACM SIGCOMM 2022 CONFERENCE, 2022, :86-100

[8]

Hui LF, 2016, Journal of Communications and Information Networks, V1, P77, DOI [10.1007/bf03391559, 10.11959/j.issn.2096-1081.2016.024]

[9]

Jang E., 2017, CATEGORICAL REPARAME

[10] The Road Towards 6G: A Comprehensive Survey [J].

Jiang, Wei ;

Han, Bin ;

Habibi, Mohammad Asif ;

Schotten, Hans Dieter .

IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 :334-366

← 1 2 3 4 →