Deep Reinforcement Learning with Successive Over-Relaxation and its Application in Autoscaling Cloud Resources

被引:2
作者
John, Indu [1 ]
Bhatnagar, Shalabh [1 ]
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore, Karnataka, India
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
关键词
reinforcement learning; deep learning; cloud computing; resource allocation; atari games;
D O I
10.1109/ijcnn48605.2020.9206598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new deep reinforcement learning algorithm using the technique of successive over-relaxation (SOR) in Deep Q-networks (DQNs). The new algorithm, named SOR-DQN, uses modified targets in the DQN framework with the aim of accelerating training. This work is motivated by the problem of auto-scaling resources for cloud applications, for which existing algorithms suffer from issues such as slow convergence, poor performance during the training phase and non-scalability. For the above problem, SOR-DQN achieves significant improvements over DQN on both synthetic and real datasets. We also study the generalization ability of the algorithm to multiple tasks by using it to train agents playing Atari video games.
引用
收藏
页数:6
相关论文
共 21 条
  • [1] [Anonymous], 2016, Asynchronous methods for deep reinforcement learning
  • [2] [Anonymous], 2015, P INT C LEARN REPR I
  • [3] Arlitt M.F., 1996, PROC ACM SIGMETRICS, P126
  • [4] Azar M. G., 2011, SPEEDY Q LEARNING
  • [5] The Arcade Learning Environment: An Evaluation Platform for General Agents
    Bellemare, Marc G.
    Naddaf, Yavar
    Veness, Joel
    Bowling, Michael
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 : 253 - 279
  • [6] Bertsekas DP, 1995, PROCEEDINGS OF THE 34TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P560, DOI 10.1109/CDC.1995.478953
  • [7] DERP: A Deep Reinforcement Learning Cloud System for Elastic Resource Provisioning
    Bitsakos, Constantinos
    Konstantinou, Ioannis
    Koziris, Nectarios
    [J]. 2018 16TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2018), 2018, : 21 - 29
  • [8] CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms
    Calheiros, Rodrigo N.
    Ranjan, Rajiv
    Beloglazov, Anton
    De Rose, Cesar A. F.
    Buyya, Rajkumar
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2011, 41 (01) : 23 - 50
  • [9] Du Yunshu, 2017, ARXIV170904083
  • [10] Dutreilh X, 2011, PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTONOMIC AND AUTONOMOUS SYSTEMS (ICAS 2011), P67