Q-Learning Based Routing Protocol for Congestion Avoidance

被引：7

作者：

Godfrey, Daniel ^{[1
]}

Kim, Beom-Su ^{[1
]}

Miao, Haoran ^{[1
]}

Shah, Babar ^{[2
]}

Hayat, Bashir ^{[3
]}

Khan, Imran ^{[4
]}

Sung, Tae-Eung ^{[5
]}

Kim, Ki-Il ^{[1
]}

机构：

[1] Chungnam Natl Univ, Dept Comp Sci & Engn, Daejeon, South Korea

[2] Zayed Univ, Coll Technol Innovat, Abu Dhabi, U Arab Emirates

[3] Inst Management Sci, Peshawar, Pakistan

[4] Univ Engn & Technol, Dept Elect Engn, Peshawar, Pakistan

[5] Yonsei Univ, Dept Comp & Telecommun Engn, Seoul, South Korea

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 68卷 / 03期

关键词：

Congestion-aware routing; reinforcement learning; Q-learning; Software defined networks; FAILURE RECOVERY; SOFTWARE; SDN;

D O I：

10.32604/cmc.2021.017475

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The end-to-end delay in a wired network is strongly dependent on congestion on intermediate nodes. Among lots of feasible approaches to avoid congestion efficiently, congestion-aware routing protocols tend to search for an uncongested path toward the destination through rule-based approaches in reactive/incident-driven and distributed methods. However, these previous approaches have a problem accommodating the changing network environments in autonomous and self-adaptive operations dynamically. To overcome this drawback, we present a new congestion-aware routing protocol based on a Q-learning algorithm in software-defined networks where logically centralized network operation enables intelligent control and management of network resources. In a proposed routing protocol, either one of uncongested neighboring nodes are randomly selected as next hop to distribute traffic load to multiple paths or Q-learning algorithm is applied to decide the next hop by modeling the state, Q-value, and reward function to set the desired path toward the destination. A new reward function that consists of a buffer occupancy, link reliability and hop count is considered. Moreover, look ahead algorithm is employed to update the Q-value with values within two hops simultaneously. This approach leads to a decision of the optimal next hop by taking congestion status in two hops into account, accordingly. Finally, the simulation results presented approximately 20% higher packet delivery ratio and 15% shorter end-to-end delay, compared to those with the existing scheme by avoiding congestion adaptively.

引用

页码：3671 / 3692

页数：22

共 50 条

[1] Congestion Prevention Mechanism Based on Q-learning for Efficient Routing in SDN
Kim, Seonhyeok
Son, Jaehyeok
Talukder, Ashis
Hong, Choong Seon
2016 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2016, : 124 - 128
[2] Double Q-learning based routing protocol for opportunistic networks
Singh, Jagdeep
Dhurandher, Sanjay Kumar
Woungang, Isaac
Barolli, Leonard
JOURNAL OF HIGH SPEED NETWORKS, 2023, 29 (01) : 1 - 14
[3] Q-FANET: Improved Q-learning based routing protocol for FANETs
da Costa, Luis Antonio L. F.
Kunst, Rafael
de Freitas, Edison Pignaton
COMPUTER NETWORKS, 2021, 198
[4] Q-learning based energy-efficient and void avoidance routing protocol for underwater acoustic sensor networks
Khan, Zahoor Ali
Karim, Obaida Abdul
Abbas, Shahid
Javaid, Nadeem
Bin Zikria, Yousaf
Tariq, Usman
COMPUTER NETWORKS, 2021, 197
[5] QGrid: Q-learning Based Routing Protocol for Vehicular Ad Hoc Networks
Li, Ruiling
Li, Fan
Li, Xin
Wang, Yu
2014 IEEE INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2014,
[6] Q-learning based Stepwise Routing Protocol for Multi-UAV Networks
Lim, Jae Won
Ko, Young-Bae
3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 307 - 309
[7] Q-Learning Based Routing in Optical Networks
Bryant, Nolen B.
Chung, Kwok K.
Feng, Jie
Harris, Sommer
Umeh, Kristine N.
Aibin, Michal
2022 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2022, : 419 - 422
[8] Q-learning Based Delay Sensitive Routing Protocol for Maritime Search and Rescue Networks
Wang, Zhen
Lin, Bin
2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
[9] QLAR: A Q-Learning based Adaptive Routing for MANETs
Serhani, Abdellatif
Naja, Najib
Jamali, Abdellah
2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
[10] Q-learning for adaptive, load based routing.
Nowe, A
Steenhaut, K
Fakir, M
Verbeeck, K
1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 3965 - 3970

← 1 2 3 4 5 →