RAN Information-Assisted TCP Congestion Control Using Deep Reinforcement Learning With Reward Redistribution

被引：5

作者：

Chen, Minghao ^{[1
]}

Li, Rongpeng ^{[1
]}

Crowcroft, Jon ^{[2
]}

Wu, Jianjun ^{[3
]}

Zhao, Zhifeng ^{[4
]}

Zhang, Honggang ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China

[2] Univ Cambridge, Dept Comp Sci, Cambridge CB2 1TN, England

[3] Huawei Technol Co Ltd, Shanghai 201206, Peoples R China

[4] Zhejiang Lab, Hangzhou 311121, Peoples R China

来源：

IEEE TRANSACTIONS ON COMMUNICATIONS | 2022年 / 70卷 / 01期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Reinforcement learning; Servers; Internet; Throughput; Radio access networks; Bandwidth; 5G mobile communication; Deep reinforcement learning; congestion control; radio access network; reward redistribution; delayed feedback;

D O I：

10.1109/TCOMM.2021.3123130

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we aim to propose a novel transmission control protocol (TCP) congestion control method from a cross-layer-based perspective and present a deep reinforcement learning (DRL)-driven method called DRL-3R (DRL for congestion control with Radio access network information and Reward Redistribution) so as to learn the TCP congestion control policy in a superior manner. In particular, we incorporate the RAN information to timely grasp the dynamics of RAN, and empower DRL to learn from the delayed RAN information feedback potentially induced by several consecutive actions. Meanwhile, we relax the implicit assumption (that the feedback to one specific action returns at a round-trip-time (RTT) after the action is applied) in previous researches, by redistributing the rewards and evaluating the merits of actions more accurately. Experiment results show that besides maintaining a reasonable fairness, DRL-3R significantly outperforms classical congestion control methods (e.g., TCP Reno, Westwood, Cubic, BBR and DRL-CC) on network utility by achieving a higher throughput while reducing delay in various network environments.

引用

页码：215 / 230

页数：16

共 50 条

[21] Experience-Driven Congestion Control: When Multi-Path TCP Meets Deep Reinforcement Learning [J].

Xu, Zhiyuan ;

Tang, Jian ;

Yin, Chengxiang ;

Wang, Yanzhi ;

Xue, Guoliang .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (06) :1325-1336

[22] Deep Reinforcement Learning for Optimization of RAN Slicing Relying on Control- and User-Plane Separation [J].

Tu, Haiyan ;

Zhao, Liqiang ;

Zhang, Yaoyuan ;

Zheng, Gan ;

Feng, Chen ;

Song, Shenghui ;

Liang, Kai .

IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05) :8485-8498

[23] SmartCC: A Reinforcement Learning Approach for Multipath TCP Congestion Control in Heterogeneous Networks [J].

Li, Wenzhong ;

Zhang, Han ;

Gao, Shaohua ;

Xue, Chaojing ;

Wang, Xiaoliang ;

Lu, Sanglu .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (11) :2621-2633

[24] Modelling and analysis of TCP congestion control mechanisms using stochastic reward nets [J].

Younes, Osama S. .

INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2019, 10 (04) :390-412

[25] DeepCC: Multi-Agent Deep Reinforcement Learning Congestion Control for Multi-Path TCP Based on Self-Attention [J].

He, Bo ;

Wang, Jingyu ;

Qi, Qi ;

Sun, Haifeng ;

Liao, Jianxin ;

Du, Chunning ;

Yang, Xiang ;

Han, Zhu .

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (04) :4770-4788

[26] ICRAN: Intelligent Control for Self-Driving RAN Based on Deep Reinforcement Learning [J].

Ahmed, Azza H. ;

Elmokashfi, Ahmed .

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (03) :2751-2766

[27] Ablation Study of Deep Reinforcement Learning Congestion Control in Cellular Network Settings [J].

Naqvi, Haidlir ;

Anggorojati, Bayu .

2022 25TH INTERNATIONAL SYMPOSIUM ON WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS (WPMC), 2022,

[28] A cache-aware congestion control mechanism using deep reinforcement learning for wireless sensor networks [J].

Alipio, Melchizedek ;

Bures, Miroslav .

AD HOC NETWORKS, 2025, 166

[29] Implementability improvement of deep reinforcement learning based congestion control in cellular network [J].

Naqvi, Haidlir Achmad ;

Hilman, Muhammad Hafizhuddin ;

Anggorojati, Bayu .

COMPUTER NETWORKS, 2023, 233

[30] Design and Performance Evaluation of Enhanced Congestion Control Algorithm for Wireless TCP by using a Deep Learning [J].

Han, Kimoon ;

Hwang, Ankyu ;

Lee, Jae Yong ;

Kim, Byung Chul .

2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, :467-468

← 1 2 3 4 5 →