A Routing Optimization Method for Software-Defined Optical Transport Networks Based on Ensembles and Reinforcement Learning

被引:13
作者
Chen, Junyan [1 ,2 ]
Xiao, Wei [1 ]
Li, Xinmei [1 ]
Zheng, Yang [3 ]
Huang, Xuefeng [1 ]
Huang, Danli [1 ]
Wang, Min [1 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
[2] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
关键词
optical transport network; software-defined networking; deep Q-network; message-passing neural network; ensemble learning; ALGORITHM; QUALITY; COST; SDN; QOS;
D O I
10.3390/s22218139
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Optical transport networks (OTNs) are widely used in backbone- and metro-area transmission networks to increase network transmission capacity. In the OTN, it is particularly crucial to rationally allocate routes and maximize network capacities. By employing deep reinforcement learning (DRL)- and software-defined networking (SDN)-based solutions, the capacity of optical networks can be effectively increased. However, because most DRL-based routing optimization methods have low sample usage and difficulty in coping with sudden network connectivity changes, converging in software-defined OTN scenarios is challenging. Additionally, the generalization ability of these methods is weak. This paper proposes an ensembles- and message-passing neural-network-based Deep Q-Network (EMDQN) method for optical network routing optimization to address this problem. To effectively explore the environment and improve agent performance, the multiple EMDQN agents select actions based on the highest upper-confidence bounds. Furthermore, the EMDQN agent captures the network's spatial feature information using a message passing neural network (MPNN)-based DRL policy network, which enables the DRL agent to have generalization capability. The experimental results show that the EMDQN algorithm proposed in this paper performs better in terms of convergence. EMDQN effectively improves the throughput rate and link utilization of optical networks and has better generalization capabilities.
引用
收藏
页数:19
相关论文
共 34 条
[1]  
Agarwal R, 2020, PR MACH LEARN RES, V119
[2]  
Almasan J., 2020, Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optimization use case
[3]  
Che Xiangbei, 2021, Computer Engineering and Applications, V57, P93, DOI 10.3778/j.issn.1002-8331.2003-0423
[4]   Machine-Learning Based Routing Pre-plan for SDN [J].
Chen, Fengqing ;
Zheng, Xianghan .
MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, MIWAI 2015, 2015, 9426 :149-159
[5]   ALBLP: Adaptive Load-Balancing Architecture Based on Link-State Prediction in Software-Defined Networking [J].
Chen, Junyan ;
Wang, Yong ;
Huang, Xuefeng ;
Xie, Xiaolan ;
Zhang, Hongmei ;
Lu, Xiaoye .
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
[6]   Traffic Engineering With Equal-Cost-MultiPath: An Algorithmic Perspective [J].
Chiesa, Marco ;
Kindler, Guy ;
Schapira, Michael .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2017, 25 (02) :779-792
[7]   A4SDN-Adaptive Alienated Ant Algorithm for Software-Defined Networking [J].
Di Stefano, Antonella ;
Cammarata, Giovanni ;
Morana, Giovanni ;
Zito, Daniele .
2015 10TH INTERNATIONAL CONFERENCE ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC), 2015, :344-350
[8]   A survey on ensemble learning [J].
Dong, Xibin ;
Yu, Zhiwen ;
Cao, Wenming ;
Shi, Yifan ;
Ma, Qianli .
FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (02) :241-258
[9]  
Fang C., 2010, P 19 ANN WIRELESS OP, P1
[10]   Target-driven visual navigation in indoor scenes using reinforcement learning and imitation learning [J].
Fang, Qiang ;
Xu, Xin ;
Wang, Xitong ;
Zeng, Yujun .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (02) :167-176