Hierarchical Reinforcement Learning and Parallel Computing Applied to the K-Server Problem

被引:5
作者
Costa, M. L. [1 ]
Padilha, C. A. A. [3 ]
Melo, J. D. [2 ]
Doria Neto, A. D. [2 ]
机构
[1] Univ Estado Rio Grande Norte UERN, Mossoro, RN, Brazil
[2] Univ Estado Rio Grande Norte UERN, Natal, RN, Brazil
[3] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
关键词
Metrical Task Systems; The K-Server Problem; Curse of Dimensionality; Hierarchical Reinforcement Learning; Q-Learning Algorithm; Parallel Computing;
D O I
10.1109/TLA.2016.7786315
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper was proposed an algorithm based on Hierarchical Reinforcement Learning (HRL) and Parallel Computing to solve an online computing problem, the K-Server Problem (KSP). The size of the storage structure used for reinforcement learning to obtain the optimal policy grows exponentially with the number of states and actions, limiting its use to smaller problems due to the curse of dimensionality. The problem is modeled as a multiple steps decision process computed in parallel by applying the Q-learning algorithm to obtain optimal policies in a reduced number of nodes obtained from an clustering process. The results show the applicability of the proposed method to real problems of large size.
引用
收藏
页码:4351 / 4357
页数:7
相关论文
共 8 条
  • [1] Deep reinforcement learning applied to the k-server problem
    Sousa Lins, Ramon Augusto
    Neto Doria, Adriao Duarte
    de Melo, Jorge Dantas
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 135 : 212 - 218
  • [2] Randomized K-Server on Hierarchical Binary Trees
    Cote, Aaron
    Meyerson, Adam
    Poplawski, Laura
    STOC'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL SYMPOSIUM ON THEORY OF COMPUTING, 2008, : 227 - +
  • [3] On the competitive ratio of the work function algorithm for the k-server problem
    Bartal, Y
    Koutsoupias, E
    THEORETICAL COMPUTER SCIENCE, 2004, 324 (2-3) : 337 - 345
  • [4] Work function algorithm with a moving window for solving the on-line k-server problem
    Baumgartner, Alfonzo
    Manger, Robert
    Hocenski, Zeljko
    PROCEEDINGS OF THE ITI 2007 29TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2007, : 591 - +
  • [5] A network flow implementation of a modified work function algorithm for solving the k-server problem
    Baumgartner, Alfonzo
    Manger, Robert
    Hocenski, Zeljko
    SOR'07: PROCEEDINGS OF THE 9TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH IN SLOVENIA, 2007, : 83 - +
  • [6] Hierarchical reinforcement learning as creative problem solving
    Colin, Thomas R.
    Belpaeme, Tony
    Cangelosi, Angelo
    Hemion, Nikolas
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 86 : 196 - 206
  • [7] Emergency Computing: An Adaptive Collaborative Inference Method Based on Hierarchical Reinforcement Learning
    Fu, Weiqi
    Xu, Lianming
    Wu, Xin
    Wang, Li
    Fei, Aiguo
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [8] An End-to-end Hierarchical Reinforcement Learning Framework for Large-scale Dynamic Flexible Job-shop Scheduling Problem
    Lei, Kun
    Guo, Peng
    Wang, Yi
    Xiong, Jianyu
    Zhao, Wenchao
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,