Cascade Reinforcement Learning with State Space Factorization for O-RAN-based Traffic Steering

被引:1
作者
Sun, Chuanneng [1 ]
Jung, Gueyoung [1 ]
Tran, Tuyen X. [1 ]
Pompili, Dario [1 ]
机构
[1] Rutgers Univ New Brunswick, Dept Elect & Comp Engn, New Brunswick, NJ 08901 USA
来源
2024 21ST ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING, SECON | 2024年
关键词
O-RAN; traffic steering; reinforcement learning;
D O I
10.1109/SECON64284.2024.10934854
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study the Traffic Steering (TS) problem in Open Radio Access Network (O-RAN), leveraging its RAN Intelligent Controller (RIC), in which RAN configuration parameters of cells can be jointly and dynamically optimized in near-real-time. To address the TS problem, we propose a novel Cascade Reinforcement Learning (CaRL) framework, where we propose state space factorization and policy decomposition to mitigate the need for large complex models and well-labeled datasets. For each sub-state space, an RL sub-policy is trained to optimize the Quality of Service (QoS). To apply CaRL to new network areas, we propose a knowledge transfer approach to initialize a new sub-policy based on knowledge learned by the trained policies. To evaluate CaRL, we build a data-driven and scalable RIC Digital Twin (DT) that is modeled using real-world data, including network setup, user geo-distribution, and traffic demand, among others, from a tier-1 RAN operator. We evaluated CaRL in two DT scenarios representing two different US cities and compared its performance with business-as-usual policy as a baseline and other competing optimization approaches (i.e., heuristic and Q-table algorithms). Furthermore, we have conducted a field trial with the RAN operator to evaluate the performance of CaRL in two areas in the Northeast US regions.
引用
收藏
页数:9
相关论文
共 33 条
[1]   Q-Learning Based Intelligent Traffic Steering in Heterogeneous Network [J].
Adachi, Koichi ;
Li, Maodong ;
Tan, Peng Hui ;
Zhou, Yuan ;
Sun, Sumei .
2016 IEEE 83RD VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2016,
[2]  
Bin Peng X, 2019, Arxiv, DOI arXiv:1910.00177
[3]   Powder: Platform for Open Wireless Data-driven Experimental Research [J].
Breen, Joe ;
Buffmire, Andrew ;
Duerig, Jonathon ;
Dutt, Kevin ;
Eide, Eric ;
Hibler, Mike ;
Johnson, David ;
Kasera, Sneha Kumar ;
Lewis, Earl ;
Maas, Dustin ;
Orange, Alex ;
Patwari, Neal ;
Reading, Daniel ;
Ricci, Robert ;
Schurig, David ;
Stoller, Leigh B. ;
Van der Merwe, Jacobus ;
Webb, Kirk ;
Wong, Gary .
PROCEEDINGS OF THE FOURTEENTH ACM INTERNATIONAL WORKSHOP ON WIRELESS NETWORK TESTBEDS, EXPERIMENTAL EVALUATION & CHARACTERIZATION (WINTECH '20), 2020, :17-24
[4]   6G Wireless Communication Systems: Applications, Requirements, Technologies, Challenges, and Research Directions [J].
Chowdhury, Mostafa Zaman ;
Shahjalal, Md ;
Ahmed, Shakil ;
Jang, Yeong Min .
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 :957-975
[5]  
Gomez-Miguelez Ismael., 2016, P 10 ACM INT WORKSHO, P25, DOI [10.1145/2980159.2980163, DOI 10.1145/2980159.2980163]
[6]  
Haarnoja T, 2018, PR MACH LEARN RES, V80
[7]   Vivisecting Mobility Management in 5G Cellular Networks [J].
Hassan, Ahmad ;
Narayanan, Arvind ;
Zhang, Anlan ;
Ye, Wei ;
Zhu, Ruiyang ;
Jin, Shuowei ;
Carpenter, Jason ;
Mao, Z. Morley ;
Qian, Feng ;
Zhang, Zhi-Li .
SIGCOMM '22: PROCEEDINGS OF THE 2022 ACM SIGCOMM 2022 CONFERENCE, 2022, :86-100
[8]  
Hui LF, 2016, Journal of Communications and Information Networks, V1, P77, DOI [10.1007/bf03391559, 10.11959/j.issn.2096-1081.2016.024]
[9]  
Jang E., 2017, CATEGORICAL REPARAME
[10]   The Road Towards 6G: A Comprehensive Survey [J].
Jiang, Wei ;
Han, Bin ;
Habibi, Mohammad Asif ;
Schotten, Hans Dieter .
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 :334-366