Q-learning for adaptive, load based routing.

被引:0
|
作者
Nowe, A [1 ]
Steenhaut, K [1 ]
Fakir, M [1 ]
Verbeeck, K [1 ]
机构
[1] Free Univ Brussels, Erasmushgsk Brussel, TW, INFO, B-1050 Brussels, Belgium
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The results reported in this paper concern the control problem of routing in packet switched internets. Using Q-learning an adaptive, distributed and autonomous routing strategy can be obtained. The objective of the Q-learner under study;is to balance the load such that average packet delivery time is optimised. If pure Q-learning is applied to routing each source has to learn the expected cost for sending a message via each of its neighbours for all destinations. Since Q-learning is basically a trial and error method packets have to be sent along non-optimal paths, which artificially increases the load. To reduce this effect and to speed up the learning a variant of Q-learning has been developed. Exploration and exploitation are partially decoupled such that stabilising features can be included in the Q-learning algorithm, to cope with instabilities and overhead that might be caused by the costly exploitation in search of alternative paths. Tn the paper the above statements will be justified mathematically and supported mathematically experiments.
引用
收藏
页码:3965 / 3970
页数:6
相关论文
共 50 条
  • [1] QLAR: A Q-Learning based Adaptive Routing for MANETs
    Serhani, Abdellatif
    Naja, Najib
    Jamali, Abdellah
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [2] Q-Learning Based Routing in Optical Networks
    Bryant, Nolen B.
    Chung, Kwok K.
    Feng, Jie
    Harris, Sommer
    Umeh, Kristine N.
    Aibin, Michal
    2022 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2022, : 419 - 422
  • [3] Q-LBR: Q-Learning Based Load Balancing Routing for UAV-Assisted VANET
    Roh, Bong-Soo
    Han, Myoung-Hun
    Ham, Jae-Hyun
    Kim, Ki-Il
    SENSORS, 2020, 20 (19) : 1 - 17
  • [4] Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
    Zhou, Jian
    Gong, Xiaotian
    Sun, Lijuan
    Xie, Yong
    Yan, Xiaoyong
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [5] Q-Learning Based Routing Protocol for Congestion Avoidance
    Godfrey, Daniel
    Kim, Beom-Su
    Miao, Haoran
    Shah, Babar
    Hayat, Bashir
    Khan, Imran
    Sung, Tae-Eung
    Kim, Ki-Il
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3671 - 3692
  • [6] Adaptive UAV-Assisted Geographic Routing With Q-Learning in VANET
    Jiang, Shanshan
    Huang, Zhitong
    Ji, Yuefeng
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (04) : 1358 - 1362
  • [7] Q-learning based Adaptive Subgraph Generation Algorithm for Graph Routing in ISA 100.11a
    Khan, Faraz Idris
    Kim, Ki-Hyung
    2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 1099 - 1101
  • [8] Adaptive Bases for Q-learning
    Di Castro, Dotan
    Mannor, Shie
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 4587 - 4593
  • [9] Adaptive Learning Recommendation Strategy Based on Deep Q-learning
    Tan, Chunxi
    Han, Ruijian
    Ye, Rougang
    Chen, Kani
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2020, 44 (04) : 251 - 266
  • [10] A Dynamic Planning Algorithm based on Q-Learning Routing in SDON
    Shang, Jingkun
    Li, Hui
    Man, Xiangkun
    Wu, Fang
    Zhao, Jia Wei
    Ma, Xiaomei
    2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,