Q-learning for adaptive, load based routing.

被引:0
|
作者
Nowe, A [1 ]
Steenhaut, K [1 ]
Fakir, M [1 ]
Verbeeck, K [1 ]
机构
[1] Free Univ Brussels, Erasmushgsk Brussel, TW, INFO, B-1050 Brussels, Belgium
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The results reported in this paper concern the control problem of routing in packet switched internets. Using Q-learning an adaptive, distributed and autonomous routing strategy can be obtained. The objective of the Q-learner under study;is to balance the load such that average packet delivery time is optimised. If pure Q-learning is applied to routing each source has to learn the expected cost for sending a message via each of its neighbours for all destinations. Since Q-learning is basically a trial and error method packets have to be sent along non-optimal paths, which artificially increases the load. To reduce this effect and to speed up the learning a variant of Q-learning has been developed. Exploration and exploitation are partially decoupled such that stabilising features can be included in the Q-learning algorithm, to cope with instabilities and overhead that might be caused by the costly exploitation in search of alternative paths. Tn the paper the above statements will be justified mathematically and supported mathematically experiments.
引用
收藏
页码:3965 / 3970
页数:6
相关论文
共 50 条
  • [21] Topology-Aware Resilient Routing Protocol for FANETs: An Adaptive Q-Learning Approach
    Cui, Yanpeng
    Zhang, Qixun
    Feng, Zhiyong
    Wei, Zhiqing
    Shi, Ce
    Yang, Heng
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19) : 18632 - 18649
  • [22] Forecasting on Trading: A Parameter Adaptive Framework Based on Q-learning
    Chen, Chao
    Li, Yelin
    Bu, Hui
    Wu, Junjie
    Xiong, Zhang
    2018 15TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2018,
  • [23] Adaptive packet scheduling in IoT environment based on Q-learning
    Kim, Donghyun
    Lee, Taeho
    Kim, Sejun
    Lee, Byungjun
    Youn, Hee Yong
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (06) : 2225 - 2235
  • [24] Adaptive packet scheduling in IoT environment based on Q-learning
    Donghyun Kim
    Taeho Lee
    Sejun Kim
    Byungjun Lee
    Hee Yong Youn
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 2225 - 2235
  • [25] Q-LEARNING BASED CONTROL ALGORITHM FOR HTTP ADAPTIVE STREAMING
    Martin, Virginia
    Cabrera, Julian
    Garcia, Narciso
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [26] Goal evolution based on adaptive Q-learning for intelligent agent
    Kuo, Jong Yih
    Tsai, Ming Lan
    Hsueh, Nien Lin
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 434 - +
  • [27] Q-Learning Based Distributed Adaptive Algorithm for Topological Stability
    Huang Q.-D.
    Shi B.-Y.
    Guo M.-P.
    Yuan R.-Z.
    Chen C.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2020, 49 (02): : 262 - 268
  • [28] An Adaptive Routing Scheme Based on Q-learning and Real-time Traffic Monitoring for Network-on-Chip
    Fan, Renshi
    Du, Gaoming
    Xu, Pengfei
    Li, Zhenmin
    Song, Yukun
    Zhang, Duoli
    PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 244 - 248
  • [29] A Double Q-Learning Routing in Delay Tolerant Networks
    Yuan, Fan
    Wu, Jaogao
    Zhou, Hongyu
    Liu, Linfeng
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [30] A Tailored Q-Learning for Routing in Wireless Sensor Networks
    Sharma, Varun K.
    Shukla, Shiv Shankar Prasad
    Singh, Varun
    2012 2ND IEEE INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2012, : 663 - 668