Joint Routing and Scheduling Optimization in Time-Sensitive Networks Using Graph-Convolutional-Network-Based Deep Reinforcement Learning

被引:42
作者
Yang, Liu [1 ]
Wei, Yifei [1 ]
Yu, F. Richard [2 ]
Han, Zhu [3 ,4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[2] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada
[3] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA
[4] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul 446701, South Korea
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning (DRL); graph convolutional network (GCN); joint routing and scheduling; time-sensitive networking (TSN); worst case end-to-end latency;
D O I
10.1109/JIOT.2022.3188826
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing number of Internet of Things (IoT) devices brings enormous time-sensitive applications, which require real-time transmission to effectuate communication services. The ultrareliable and low-latency communication (URLLC) scenario in the fifth generation (5G) has played a critical role in supporting services with delay-sensitive properties. Time-sensitive networking (TSN) has been widely considered as a promising paradigm for enabling the deterministic transmission guarantees for 5G. However, TSN is a hybrid traffic system with time-sensitive traffic and best effort traffic, which require effective routing and scheduling to provide a deterministic and bounded delay. While joint optimization of time-sensitive and non-time-sensitive traffic greatly increases the solution space and brings a significant challenge to obtain solutions. Therefore, this article proposes a graph convolutional network-based deep reinforcement learning (GCN-based DRL) solution for the joint optimization problem in practical communication scenarios. The GCN is integrated into deep reinforcement learning (DRL) to obtain the network's spatial dependence and elevate the generalization performance of the proposed method. Specifically, the GCN adopts the first-order Chebyshev polynomial to approximate the graph convolution kernel, which reduces the complexity of the algorithm and improves the feasibility for the joint optimization task. Furthermore, priority experience replay is employed to accelerate the convergence speed of the model training process. Numerical simulations demonstrate that the proposed GCN-based DRL algorithm has good convergence and outperforms the benchmark methods in terms of the average end-to-end delay.
引用
收藏
页码:23981 / 23994
页数:14
相关论文
共 46 条
[1]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[2]   Routing and Scheduling of Time-Triggered Traffic in Time-Sensitive Networks [J].
Atallah, Ayman A. ;
Hamad, Ghaith Bany ;
Mohamed, Otmane Ait .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (07) :4525-4534
[3]  
Avranas A., 2020, arXiv
[4]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[5]   A GNN-Based Supervised Learning Framework for Resource Allocation in Wireless IoT Networks [J].
Chen, Tianrui ;
Zhang, Xinruo ;
You, Minglei ;
Zheng, Gan ;
Lambotharan, Sangarapillai .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (03) :1712-1724
[6]   EFFICIENT POWER ALLOCATION USING GRAPH NEURAL NETWORKS AND DEEP ALGORITHM UNFOLDING [J].
Chowdhury, Arindam ;
Verma, Gunjan ;
Rao, Chirag ;
Swami, Ananthram ;
Segarra, Santiago .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :4725-4729
[7]   Unfolding WMMSE Using Graph Neural Networks for Efficient Power Allocation [J].
Chowdhury, Arindam ;
Verma, Gunjan ;
Rao, Chirag ;
Swami, Ananthram ;
Segarra, Santiago .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (09) :6004-6017
[8]  
Chuang CC, 2020, IEEE INT C EMERG, P254, DOI 10.1109/ETFA46521.2020.9211969
[9]   Scheduling Real-Time Communication in IEEE 802.1Qbv Time Sensitive Networks [J].
Craciunas, Silviu S. ;
Oliver, Ramon Serna ;
Chmelik, Martin ;
Steiner, Wilfried .
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON REAL-TIME NETWORKS AND SYSTEMS PROCEEDINGS (RTNS 2016), 2016, :183-192
[10]   No-wait Packet Scheduling for IEEE Time-sensitive Networks (TSN) [J].
Duerr, Frank ;
Nayak, Naresh Ganesh .
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON REAL-TIME NETWORKS AND SYSTEMS PROCEEDINGS (RTNS 2016), 2016, :203-212