Automating the Configuration of MapReduce: A Reinforcement Learning Scheme

被引:8
作者
Mu, Ting-Yu [1 ]
Al-Fuqaha, Ala [1 ,2 ]
Salah, Khaled [3 ]
机构
[1] Western Michigan Univ, Comp Sci Dept, Kalamazoo, MI 49008 USA
[2] Hamad Bin Khalifa Univ, Coll Sci & Engn, Doha, Qatar
[3] Khalifa Univ Sci & Technol, Elect & Comp Engn Dept, Abu Dhabi, U Arab Emirates
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2020年 / 50卷 / 11期
关键词
Deep learning; deep Q-network (DQN); machine learning; MapReduce; neural networks; reinforcement learning (RL); self-configuration;
D O I
10.1109/TSMC.2019.2951789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the exponential growth of data and the high demand for the analysis of large datasets, the MapReduce framework has been widely utilized to process data in a timely, cost-effective manner. It is well-known that the performance of MapReduce is limited by its default configuration parameters, and there are a few research studies that have focused on finding the optimal configurations to improve the performance of the MapReduce framework. Recently, machine learning based approaches have been receiving more attention to be utilized to auto configure the MapReduce parameters to account for the dynamic nature of the applications. In this article, we propose and develop a reinforcement learning (RL)-based scheme, named RL-MRCONF, to automatically configure the MapReduce parameters. Specifically, we explore and experiment with two variations of RL-MRCONF; one variation is based on the traditional RL algorithm and the second is based on the deep RL algorithm. Results obtained from simulations show that the RL-MRCONF has the ability to successfully and effectively auto-configure the MapReduce parameters dynamically according to changes in job types and computing resources. Moreover, simulation results show our proposed RL-MRCONF scheme outperforms the traditional RL-based implementation. Using datasets provided by MR-Perf, simulation results show that our proposed scheme provides around 50% performance improvement in terms of execution time when compared with MapReduce using default settings.
引用
收藏
页码:4183 / 4196
页数:14
相关论文
共 50 条
  • [1] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
    Agostinelli, Forest
    Hocquet, Guillaume
    Singh, Sameer
    Baldi, Pierre
    BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
  • [2] Automating Multi-element Subspace Exploration via Reinforcement Learning
    Sun, Yi
    Liu, YinXiao
    Wang, ZhongYao
    Niu, BaoLong
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 99 - 103
  • [3] A Reinforcement Learning Methodology for The Search of SRAM CIM-based Accelerator Configuration
    Lai, Bo-Xi
    Huang, Shih-Hsu
    Kao, Hsu-Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 141 - 142
  • [4] Setup and Configuration of MapReduce in a Cloud Environment
    Blaisse, Adam Pasqua
    Berlove, Martin
    Wu, Jie
    2014 IEEE 11TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2014, : 773 - 774
  • [5] Privacy preserving scheme for MapReduce
    Shetty, Madhvaraj M.
    Manjaiah, D. H.
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT ,POWER AND COMPUTING TECHNOLOGIES (ICCPCT), 2017,
  • [6] Optimization of configuration of corrugated airfoil using deep reinforcement learning and transfer learning
    Noda, T.
    Okabayashi, K.
    Kimura, S.
    Takeuchi, S.
    Kajishima, T.
    AIP ADVANCES, 2023, 13 (03)
  • [7] Prioritized Environment Configuration for Drone Control with Deep Reinforcement Learning
    Jang, Sooyoung
    Choi, Changbeom
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2022, 12
  • [8] Reinforcement learning based energy efficient resource allocation strategy of MapReduce jobs with deadline constraint
    Greeshma Lingam
    Cluster Computing, 2023, 26 : 2719 - 2735
  • [9] Reinforcement learning based energy efficient resource allocation strategy of MapReduce jobs with deadline constraint
    Lingam, Greeshma
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (05): : 2719 - 2735
  • [10] An Optimization Scheme in MapReduce for Reduce Stage
    Liu, Qi
    Cai, Weidong
    Wang, Baowei
    Fu, Zhangjie
    Linge, Nigel
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (08): : 197 - 207