Building a Digital Twin for network optimization using Graph Neural Networks

被引:43
作者
Ferriol-Galmes, Miquel [1 ]
Suarez-Varela, Jose [1 ]
Paillisse, Jordi [1 ]
Shi, Xiang [2 ]
Xiao, Shihan [2 ]
Cheng, Xiangle [2 ]
Barlet-Ros, Pere [1 ]
Cabellos-Aparicio, Albert [1 ]
机构
[1] Univ Politecn Cataluna, Barcelona Neural Networking Ctr, Catalunya, Spain
[2] Huawei Technol Co Ltd, Shenzhen, Peoples R China
关键词
Digital Twin; Graph Neural Networks; Network optimization; Deep Learning; Network modeling;
D O I
10.1016/j.comnet.2022.109329
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Network modeling is a critical component of Quality of Service (QoS) optimization. Current networks implement Service Level Agreements (SLA) by careful configuration of both routing and queue scheduling policies. However, existing modeling techniques are not able to produce accurate estimates of relevant SLA metrics, such as delay or jitter, in networks with complex QoS-aware queueing policies (e.g., strict priority, Weighted Fair Queueing, Deficit Round Robin). Recently, Graph Neural Networks (GNNs) have become a powerful tool to model networks since they are specifically designed to work with graph-structured data. In this paper, we propose a GNN-based network model able to understand the complex relationship between (i) the queueing policy (scheduling algorithm and queue sizes), (ii) the network topology, (iii) the routing configuration, and (iv) the input traffic matrix. We call our model TwinNet, a Digital Twin that can accurately estimate relevant SLA metrics for network optimization. TwinNet can generalize to its input parameters, operating successfully in topologies, routing, and queueing configurations never seen during training. We evaluate TwinNet over a wide variety of scenarios with synthetic traffic and validate it with real traffic traces. Our results show that TwinNet can provide accurate estimates of end-to-end path delays in 106 unseen real -world topologies, under different queuing configurations with a Mean Absolute Percentage Error (MAPE) of 3.8%, as well as a MAPE of 6.3% error when evaluated with a real testbed. We also showcase the potential of the proposed model for SLA-driven network optimization and what-if analysis.
引用
收藏
页数:14
相关论文
共 72 条
[1]   A roadmap for traffic engineering in SDN-OpenFlow networks [J].
Akyildiz, Ian F. ;
Lee, Ahyoung ;
Wang, Pu ;
Luo, Min ;
Chou, Wu .
COMPUTER NETWORKS, 2014, 71 :1-30
[2]  
Almasan J., 2020, Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optimization use case
[3]  
[Anonymous], 2001, Network Calculus: a Theory of Deterministic Queuing Systems for the Internet
[4]  
Autiosalo J, 2018, 2018 IEEE 4TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), P241, DOI 10.1109/WF-IoT.2018.8355217
[5]  
Barreto F, 2012, Arxiv, DOI arXiv:1204.2465
[6]  
Battaglia PW, 2016, ADV NEUR IN, V29
[7]   Is Machine Learning Ready for Traffic Engineering Optimization? [J].
Bernardez, Guillermo ;
Suarez-Varela, Jose ;
Lopez, Albert ;
Wu, Bo ;
Xiao, Shihan ;
Cheng, Xiangle ;
Barlet-Ros, Pere ;
Cabellos-Aparicio, Albert .
2021 IEEE 29TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP 2021), 2021,
[8]  
Bhatia Randeep, 2015, 2015 IEEE Conference on Computer Communications (INFOCOM). Proceedings, P657, DOI 10.1109/INFOCOM.2015.7218434
[9]  
Botchkarev A, 2018, Arxiv, DOI [arXiv:1809.03006, DOI 10.48550/ARXIV.1809.03006]
[10]   Stability and generalization [J].
Bousquet, O ;
Elisseeff, A .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) :499-526