On Deep Reinforcement Learning for Static Routing and Wavelength Assignment

被引：26

作者：

Di Cicco, Nicola ^{[1
]}

Mercan, Emre Furkan ^{[1
]}

Karandin, Oleg ^{[1
]}

Ayoub, Omran ^{[2
]}

Troia, Sebastian ^{[1
]}

Musumeci, Francesco ^{[1
]}

Tornatore, Massimo ^{[1
]}

机构：

[1] Politecn Milan, Dept Elect Informat & Bioengn DEIB, I-20133 Milan, Italy

[2] Univ Appl Sci Southern Switzerland, CH-6928 Manno, Switzerland

来源：

IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS | 2022年 / 28卷 / 04期

关键词：

Training; Routing; Heuristic algorithms; Reinforcement learning; Wavelength assignment; Topology; Network topology; Deep reinforcement learning; genetic algorithm; optimization; routing and wavelength assignment; NETWORKS;

D O I：

10.1109/JSTQE.2022.3151323

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep Reinforcement Learning (DRL) is rising as a promising tool for solving optimization problems in optical networks. Though studies employing DRL for solving static optimization problems in optical networks are appearing, assessing strengths and weaknesses of DRL with respect to state-of-the-art solution methods is still an open research question. In this work, we focus on Routing and Wavelength Assignment (RWA), a well-studied problem for which fast and scalable algorithms leading to better optimality gaps are always sought for. We develop two different DRL-based methods to assess the impact of different design choices on DRL performance. In addition, we propose a Multi-Start approach that can improve the average DRL performance, and we engineer a shaped reward that allows efficient learning in networks with high link capacities. With Multi-Start, DRL gets competitive results with respect to a state-of-the-art Genetic Algorithm with significant savings in computational times. Moreover, we assess the generalization capabilities of DRL to traffic matrices unseen during training, in terms of total connection requests and traffic distribution, showing that DRL can generalize on small to moderate deviations with respect to the training traffic matrices. Finally, we assess DRL scalability with respect to topology size and link capacity.

引用

页数：12

共 46 条

[1]

Almasan P., 2019, DEEP REINFORCEMENT L

[2]

Andrychowicz M., 2021, P INT C LEARN REPR O, P2021

[3]

Antuori Valentin, 2020, Principles and Practice of Constraint Programming. 26th International Conference, CP 2020. Proceedings. Lecture Notes in Computer Science (LNCS 12333), P657, DOI 10.1007/978-3-030-58475-7_38

[4] Machine learning for combinatorial optimization: A methodological tour d'horizon [J].

Bengio, Yoshua ;

Lodi, Andrea ;

Prouvost, Antoine .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 290 (02) :405-421

[5]

Çetinkaya EK, 2013, INT C ULTRA MOD TELE, P38, DOI 10.1109/ICUMT.2013.6798402

[6] A Multi-Task-Learning-Based Transfer Deep Reinforcement Learning Design for Autonomic Optical Networks [J].

Chen, Xiaoliang ;

Proietti, Roberto ;

Liu, Che-Yu ;

Yoo, S. J. Ben .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (09) :2878-2889

[7] Building Autonomic Elastic Optical Networks with Deep Reinforcement Learning [J].

Chen, Xiaoliang ;

Proietti, Roberto ;

Yoo, S. J. Ben .

IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (10) :20-26

[8] DeepRMSA: A Deep Reinforcement Learning Framework for Routing, Modulation and Spectrum Assignment in Elastic Optical Networks [J].

Chen, Xiaoliang ;

Li, Baojia ;

Proietti, Roberto ;

Lu, Hongbo ;

Zhu, Zuqing ;

Yoo, S. J. Ben .

JOURNAL OF LIGHTWAVE TECHNOLOGY, 2019, 37 (16) :4155-4163

[9] LIGHTPATH COMMUNICATIONS - AN APPROACH TO HIGH BANDWIDTH OPTICAL WANS [J].

CHLAMTAC, I ;

GANZ, A ;

KARMI, G .

IEEE TRANSACTIONS ON COMMUNICATIONS, 1992, 40 (07) :1171-1182

[10] Offline Routing and Wavelength Assignment in Transparent WDM Networks [J].

Christodoulopoulos, Konstantinos ;

Manousakis, Konstantinos ;

Varvarigos, Emmanouel .

IEEE-ACM TRANSACTIONS ON NETWORKING, 2010, 18 (05) :1557-1570

← 1 2 3 4 5 →