RAN Information-Assisted TCP Congestion Control Using Deep Reinforcement Learning With Reward Redistribution

被引：4

作者：

Chen, Minghao ^{[1
]}

Li, Rongpeng ^{[1
]}

Crowcroft, Jon ^{[2
]}

Wu, Jianjun ^{[3
]}

Zhao, Zhifeng ^{[4
]}

Zhang, Honggang ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China

[2] Univ Cambridge, Dept Comp Sci, Cambridge CB2 1TN, England

[3] Huawei Technol Co Ltd, Shanghai 201206, Peoples R China

[4] Zhejiang Lab, Hangzhou 311121, Peoples R China

来源：

IEEE TRANSACTIONS ON COMMUNICATIONS | 2022年 / 70卷 / 01期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Reinforcement learning; Servers; Internet; Throughput; Radio access networks; Bandwidth; 5G mobile communication; Deep reinforcement learning; congestion control; radio access network; reward redistribution; delayed feedback;

D O I：

10.1109/TCOMM.2021.3123130

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we aim to propose a novel transmission control protocol (TCP) congestion control method from a cross-layer-based perspective and present a deep reinforcement learning (DRL)-driven method called DRL-3R (DRL for congestion control with Radio access network information and Reward Redistribution) so as to learn the TCP congestion control policy in a superior manner. In particular, we incorporate the RAN information to timely grasp the dynamics of RAN, and empower DRL to learn from the delayed RAN information feedback potentially induced by several consecutive actions. Meanwhile, we relax the implicit assumption (that the feedback to one specific action returns at a round-trip-time (RTT) after the action is applied) in previous researches, by redistributing the rewards and evaluating the merits of actions more accurately. Experiment results show that besides maintaining a reasonable fairness, DRL-3R significantly outperforms classical congestion control methods (e.g., TCP Reno, Westwood, Cubic, BBR and DRL-CC) on network utility by achieving a higher throughput while reducing delay in various network environments.

引用

页码：215 / 230

页数：16

共 50 条

[31] Efficient congestion control in communications using novel weighted ensemble deep reinforcement learning [J].

Ali, Majid Hamid ;

Ozturk, Serkan .

COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110

[32] Task Offloading and Trajectory Control for UAV-Assisted Mobile Edge Computing Using Deep Reinforcement Learning [J].

Zhang, Lu ;

Zhang, Zi-Yan ;

Min, Luo ;

Tang, Chao ;

Zhang, Hong-Ying ;

Wang, Ya-Hong ;

Cai, Peng .

IEEE ACCESS, 2021, 9 :53708-53719

[33] Intelligent Admission and Placement of O-RAN Slices Using Deep Reinforcement Learning [J].

Sen, Nabhasmita ;

Franklin, Antony A. .

PROCEEDINGS OF THE 2022 IEEE 8TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION (NETSOFT 2022): NETWORK SOFTWARIZATION COMING OF AGE: NEW CHALLENGES AND OPPORTUNITIES, 2022, :307-311

[34] PBQ-Enhanced QUIC: QUIC with Deep Reinforcement Learning Congestion Control Mechanism [J].

Zhang, Zhifei ;

Li, Shuo ;

Ge, Yiyang ;

Xiong, Ge ;

Zhang, Yu ;

Xiong, Ke .

ENTROPY, 2023, 25 (02)

[35] Improving the Congestion Control Performance for Mobile Networks in High-Speed Railway via Deep Reinforcement Learning [J].

Cui, Laizhong ;

Yuan, Zuxian ;

Ming, Zhongxing ;

Yang, Shu .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (06) :5864-5875

[36] Deep Reinforcement Learning-Based Distributed Congestion Control in Cellular V2X Networks [J].

Choi, Joo-Young ;

Jo, Han-Shin ;

Mun, Cheol ;

Yook, Jong-Gwan .

IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (11) :2582-2586

[37] On the Fairness of Internet Congestion Control over WiFi with Deep Reinforcement Learning [J].

Shrestha, Shyam Kumar ;

Pokhrel, Shiva Raj ;

Kua, Jonathan .

FUTURE INTERNET, 2024, 16 (09)

[38] Reward-Based Exploration: Adaptive Control for Deep Reinforcement Learning [J].

Xu, Zhi-xiong ;

Cao, Lei ;

Chen, Xi-liang ;

Li, Chen-xi .

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (09) :2409-2412

[39] Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning [J].

Akbari, Mohammad ;

Abedi, Mohammad Reza ;

Joda, Roghayeh ;

Pourghasemian, Mohsen ;

Mokari, Nader ;

Erol-Kantarci, Melike .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (08) :2487-2500

[40] An intelligent scheme for congestion control: When active queue management meets deep reinforcement learning [J].

Ma, Huihui ;

Xu, Du ;

Dai, Yueyue ;

Dong, Qing .

COMPUTER NETWORKS, 2021, 200

← 1 2 3 4 5 →