Q-Learning Based Routing Protocol for Congestion Avoidance

被引:7
|
作者
Godfrey, Daniel [1 ]
Kim, Beom-Su [1 ]
Miao, Haoran [1 ]
Shah, Babar [2 ]
Hayat, Bashir [3 ]
Khan, Imran [4 ]
Sung, Tae-Eung [5 ]
Kim, Ki-Il [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Sci & Engn, Daejeon, South Korea
[2] Zayed Univ, Coll Technol Innovat, Abu Dhabi, U Arab Emirates
[3] Inst Management Sci, Peshawar, Pakistan
[4] Univ Engn & Technol, Dept Elect Engn, Peshawar, Pakistan
[5] Yonsei Univ, Dept Comp & Telecommun Engn, Seoul, South Korea
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 68卷 / 03期
关键词
Congestion-aware routing; reinforcement learning; Q-learning; Software defined networks; FAILURE RECOVERY; SOFTWARE; SDN;
D O I
10.32604/cmc.2021.017475
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The end-to-end delay in a wired network is strongly dependent on congestion on intermediate nodes. Among lots of feasible approaches to avoid congestion efficiently, congestion-aware routing protocols tend to search for an uncongested path toward the destination through rule-based approaches in reactive/incident-driven and distributed methods. However, these previous approaches have a problem accommodating the changing network environments in autonomous and self-adaptive operations dynamically. To overcome this drawback, we present a new congestion-aware routing protocol based on a Q-learning algorithm in software-defined networks where logically centralized network operation enables intelligent control and management of network resources. In a proposed routing protocol, either one of uncongested neighboring nodes are randomly selected as next hop to distribute traffic load to multiple paths or Q-learning algorithm is applied to decide the next hop by modeling the state, Q-value, and reward function to set the desired path toward the destination. A new reward function that consists of a buffer occupancy, link reliability and hop count is considered. Moreover, look ahead algorithm is employed to update the Q-value with values within two hops simultaneously. This approach leads to a decision of the optimal next hop by taking congestion status in two hops into account, accordingly. Finally, the simulation results presented approximately 20% higher packet delivery ratio and 15% shorter end-to-end delay, compared to those with the existing scheme by avoiding congestion adaptively.
引用
收藏
页码:3671 / 3692
页数:22
相关论文
共 50 条
  • [1] Congestion Prevention Mechanism Based on Q-learning for Efficient Routing in SDN
    Kim, Seonhyeok
    Son, Jaehyeok
    Talukder, Ashis
    Hong, Choong Seon
    2016 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2016, : 124 - 128
  • [2] Double Q-learning based routing protocol for opportunistic networks
    Singh, Jagdeep
    Dhurandher, Sanjay Kumar
    Woungang, Isaac
    Barolli, Leonard
    JOURNAL OF HIGH SPEED NETWORKS, 2023, 29 (01) : 1 - 14
  • [3] Q-FANET: Improved Q-learning based routing protocol for FANETs
    da Costa, Luis Antonio L. F.
    Kunst, Rafael
    de Freitas, Edison Pignaton
    COMPUTER NETWORKS, 2021, 198
  • [4] Q-learning based energy-efficient and void avoidance routing protocol for underwater acoustic sensor networks
    Khan, Zahoor Ali
    Karim, Obaida Abdul
    Abbas, Shahid
    Javaid, Nadeem
    Bin Zikria, Yousaf
    Tariq, Usman
    COMPUTER NETWORKS, 2021, 197
  • [5] QGrid: Q-learning Based Routing Protocol for Vehicular Ad Hoc Networks
    Li, Ruiling
    Li, Fan
    Li, Xin
    Wang, Yu
    2014 IEEE INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2014,
  • [6] Q-learning based Stepwise Routing Protocol for Multi-UAV Networks
    Lim, Jae Won
    Ko, Young-Bae
    3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 307 - 309
  • [7] Q-Learning Based Routing in Optical Networks
    Bryant, Nolen B.
    Chung, Kwok K.
    Feng, Jie
    Harris, Sommer
    Umeh, Kristine N.
    Aibin, Michal
    2022 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2022, : 419 - 422
  • [8] Q-learning Based Delay Sensitive Routing Protocol for Maritime Search and Rescue Networks
    Wang, Zhen
    Lin, Bin
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [9] QLAR: A Q-Learning based Adaptive Routing for MANETs
    Serhani, Abdellatif
    Naja, Najib
    Jamali, Abdellah
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [10] Q-learning for adaptive, load based routing.
    Nowe, A
    Steenhaut, K
    Fakir, M
    Verbeeck, K
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 3965 - 3970