Routing Optimization With Deep Reinforcement Learning in Knowledge Defined Networking

被引:32
作者
He, Qiang [1 ]
Wang, Yu [2 ]
Wang, Xingwei [2 ]
Xu, Weiqiang [3 ]
Li, Fuliang [2 ]
Yang, Kaiqi [2 ]
Ma, Lianbo [4 ]
机构
[1] Northeastern Univ, Coll Med & Biol Informat Engn, Shenyang 110819, Liaoning, Peoples R China
[2] Northeastern Univ, Coll Comp Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[3] Zhejiang Sci Tech Univ, Sch Informat Sci & Engn, Hangzhou 310023, Zhejiang, Peoples R China
[4] Northeastern Univ, Coll Software Engn, Shenyang 110819, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; graph neural network; knowledge defined networking; routing optimization; GRAPH NEURAL-NETWORK;
D O I
10.1109/TMC.2023.3235446
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional routing algorithms cannot dynamically change network environments due to the limited information for routing decisions. Meanwhile, they are prone to performance bottlenecks in the face of increasingly complex business requirements. Some approaches, such as deep reinforcement learning (DRL) have been proposed to address the routing problems. However, they hardly utilize the information about the network environment fully. The Knowledge Defined Networking (KDN) architecture inspires us to develop new learning mechanisms adapted to the dynamic characteristics of the network topology. In this paper, we propose an effective scheme to solve the routing optimization problem by adding a graph neural network (GNN) structure to DRL, called Message Passing Deep Reinforcement Learning (MPDRL). MPDRL uses the characteristics of GNN to interact with the network topology environment and extracts exploitable knowledge through the message passing process of information between links in the topology. The goal is to achieve the load balance of network traffic and improve network performance. We have conducted experiments on three Internet Service Provider (ISP) network topologies. The evaluation results show that MPDRL obtains better network performance than the baseline algorithms.
引用
收藏
页码:1444 / 1455
页数:12
相关论文
共 44 条
  • [1] Abadi M, 2016, Arxiv, DOI arXiv:1605.08695
  • [2] Almasan P., 2020, arXiv
  • [3] [Anonymous], 1994, Adv. Neural Inf. Process. Syst
  • [4] Optimal oblivious routing in polynomial time
    Azar, Y
    Cohen, E
    Fiat, A
    Kaplan, H
    Räcke, H
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2004, 69 (03) : 383 - 394
  • [5] Intelligent Routing Based on Reinforcement Learning for Software-Defined Networking
    Casas-Velasco, Daniela M.
    Rendon, Oscar Mauricio Caicedo
    da Fonseca, Nelson L. S.
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (01): : 870 - 881
  • [6] RL-Routing: An SDN Routing Algorithm Based on Deep Reinforcement Learning
    Chen, Yi-Ren
    Rezapour, Amir
    Tzeng, Wen-Guey
    Tsai, Shi-Chun
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 3185 - 3199
  • [7] Chung JY, 2014, Arxiv, DOI arXiv:1412.3555
  • [8] Clark DD, 2003, ACM SIGCOMM COMP COM, V33, P3
  • [9] Cui Yong, 2002, Journal of Software, V13, P2065
  • [10] Geyer F., 2018, P 2018 WORKSH BIG DA, P40, DOI DOI 10.1145/3229607.3229610