Hierarchical Reinforcement Learning and Parallel Computing Applied to the K-Server Problem

被引：5

作者：

Costa, M. L. ^{[1
]}

Padilha, C. A. A. ^{[3
]}

Melo, J. D. ^{[2
]}

Doria Neto, A. D. ^{[2
]}

机构：

[1] Univ Estado Rio Grande Norte UERN, Mossoro, RN, Brazil

[2] Univ Estado Rio Grande Norte UERN, Natal, RN, Brazil

[3] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil

来源：

IEEE LATIN AMERICA TRANSACTIONS | 2016年 / 14卷 / 10期

关键词：

Metrical Task Systems; The K-Server Problem; Curse of Dimensionality; Hierarchical Reinforcement Learning; Q-Learning Algorithm; Parallel Computing;

D O I：

10.1109/TLA.2016.7786315

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper was proposed an algorithm based on Hierarchical Reinforcement Learning (HRL) and Parallel Computing to solve an online computing problem, the K-Server Problem (KSP). The size of the storage structure used for reinforcement learning to obtain the optimal policy grows exponentially with the number of states and actions, limiting its use to smaller problems due to the curse of dimensionality. The problem is modeled as a multiple steps decision process computed in parallel by applying the Q-learning algorithm to obtain optimal policies in a reduced number of nodes obtained from an clustering process. The results show the applicability of the proposed method to real problems of large size.

引用

页码：4351 / 4357

页数：7

共 8 条

[1] Deep reinforcement learning applied to the k-server problem
Sousa Lins, Ramon Augusto
Neto Doria, Adriao Duarte
de Melo, Jorge Dantas
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 135 : 212 - 218
[2] Randomized K-Server on Hierarchical Binary Trees
Cote, Aaron
Meyerson, Adam
Poplawski, Laura
STOC'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL SYMPOSIUM ON THEORY OF COMPUTING, 2008, : 227 - +
[3] On the competitive ratio of the work function algorithm for the k-server problem
Bartal, Y
Koutsoupias, E
THEORETICAL COMPUTER SCIENCE, 2004, 324 (2-3) : 337 - 345
[4] Work function algorithm with a moving window for solving the on-line k-server problem
Baumgartner, Alfonzo
Manger, Robert
Hocenski, Zeljko
PROCEEDINGS OF THE ITI 2007 29TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2007, : 591 - +
[5] A network flow implementation of a modified work function algorithm for solving the k-server problem
Baumgartner, Alfonzo
Manger, Robert
Hocenski, Zeljko
SOR'07: PROCEEDINGS OF THE 9TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH IN SLOVENIA, 2007, : 83 - +
[6] Hierarchical reinforcement learning as creative problem solving
Colin, Thomas R.
Belpaeme, Tony
Cangelosi, Angelo
Hemion, Nikolas
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 86 : 196 - 206
[7] Emergency Computing: An Adaptive Collaborative Inference Method Based on Hierarchical Reinforcement Learning
Fu, Weiqi
Xu, Lianming
Wu, Xin
Wang, Li
Fei, Aiguo
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
[8] An End-to-end Hierarchical Reinforcement Learning Framework for Large-scale Dynamic Flexible Job-shop Scheduling Problem
Lei, Kun
Guo, Peng
Wang, Yi
Xiong, Jianyu
Zhao, Wenchao
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,

← 1 →