Deep Reinforcement Learning with Successive Over-Relaxation and its Application in Autoscaling Cloud Resources

被引：2

作者：

John, Indu ^{[1
]}

Bhatnagar, Shalabh ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore, Karnataka, India

来源：

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年

关键词：

reinforcement learning; deep learning; cloud computing; resource allocation; atari games;

D O I：

10.1109/ijcnn48605.2020.9206598

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new deep reinforcement learning algorithm using the technique of successive over-relaxation (SOR) in Deep Q-networks (DQNs). The new algorithm, named SOR-DQN, uses modified targets in the DQN framework with the aim of accelerating training. This work is motivated by the problem of auto-scaling resources for cloud applications, for which existing algorithms suffer from issues such as slow convergence, poor performance during the training phase and non-scalability. For the above problem, SOR-DQN achieves significant improvements over DQN on both synthetic and real datasets. We also study the generalization ability of the algorithm to multiple tasks by using it to train agents playing Atari video games.

引用

页数：6

共 21 条

[1] [Anonymous], 2016, Asynchronous methods for deep reinforcement learning
[2] [Anonymous], 2015, P INT C LEARN REPR I
[3] Arlitt M.F., 1996, PROC ACM SIGMETRICS, P126
[4] Azar M. G., 2011, SPEEDY Q LEARNING
[5] The Arcade Learning Environment: An Evaluation Platform for General Agents
Bellemare, Marc G.
Naddaf, Yavar
Veness, Joel
Bowling, Michael
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 : 253 - 279
[6] Bertsekas DP, 1995, PROCEEDINGS OF THE 34TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P560, DOI 10.1109/CDC.1995.478953
[7] DERP: A Deep Reinforcement Learning Cloud System for Elastic Resource Provisioning
Bitsakos, Constantinos
Konstantinou, Ioannis
Koziris, Nectarios
[J]. 2018 16TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2018), 2018, : 21 - 29
[8] CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms
Calheiros, Rodrigo N.
Ranjan, Rajiv
Beloglazov, Anton
De Rose, Cesar A. F.
Buyya, Rajkumar
[J]. SOFTWARE-PRACTICE & EXPERIENCE, 2011, 41 (01) : 23 - 50
[9] Du Yunshu, 2017, ARXIV170904083
[10] Dutreilh X, 2011, PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTONOMIC AND AUTONOMOUS SYSTEMS (ICAS 2011), P67

← 1 2 3 →