Automating the Configuration of MapReduce: A Reinforcement Learning Scheme

被引：8

作者：

Mu, Ting-Yu ^{[1
]}

Al-Fuqaha, Ala ^{[1
,2
]}

Salah, Khaled ^{[3
]}

机构：

[1] Western Michigan Univ, Comp Sci Dept, Kalamazoo, MI 49008 USA

[2] Hamad Bin Khalifa Univ, Coll Sci & Engn, Doha, Qatar

[3] Khalifa Univ Sci & Technol, Elect & Comp Engn Dept, Abu Dhabi, U Arab Emirates

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2020年 / 50卷 / 11期

关键词：

Deep learning; deep Q-network (DQN); machine learning; MapReduce; neural networks; reinforcement learning (RL); self-configuration;

D O I：

10.1109/TSMC.2019.2951789

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the exponential growth of data and the high demand for the analysis of large datasets, the MapReduce framework has been widely utilized to process data in a timely, cost-effective manner. It is well-known that the performance of MapReduce is limited by its default configuration parameters, and there are a few research studies that have focused on finding the optimal configurations to improve the performance of the MapReduce framework. Recently, machine learning based approaches have been receiving more attention to be utilized to auto configure the MapReduce parameters to account for the dynamic nature of the applications. In this article, we propose and develop a reinforcement learning (RL)-based scheme, named RL-MRCONF, to automatically configure the MapReduce parameters. Specifically, we explore and experiment with two variations of RL-MRCONF; one variation is based on the traditional RL algorithm and the second is based on the deep RL algorithm. Results obtained from simulations show that the RL-MRCONF has the ability to successfully and effectively auto-configure the MapReduce parameters dynamically according to changes in job types and computing resources. Moreover, simulation results show our proposed RL-MRCONF scheme outperforms the traditional RL-based implementation. Using datasets provided by MR-Perf, simulation results show that our proposed scheme provides around 50% performance improvement in terms of execution time when compared with MapReduce using default settings.

引用

页码：4183 / 4196

页数：14

共 50 条

[1] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
Agostinelli, Forest
Hocquet, Guillaume
Singh, Sameer
Baldi, Pierre
BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
[2] Automating Multi-element Subspace Exploration via Reinforcement Learning
Sun, Yi
Liu, YinXiao
Wang, ZhongYao
Niu, BaoLong
2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 99 - 103
[3] A Reinforcement Learning Methodology for The Search of SRAM CIM-based Accelerator Configuration
Lai, Bo-Xi
Huang, Shih-Hsu
Kao, Hsu-Yu
2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 141 - 142
[4] Setup and Configuration of MapReduce in a Cloud Environment
Blaisse, Adam Pasqua
Berlove, Martin
Wu, Jie
2014 IEEE 11TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2014, : 773 - 774
[5] Privacy preserving scheme for MapReduce
Shetty, Madhvaraj M.
Manjaiah, D. H.
PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT ,POWER AND COMPUTING TECHNOLOGIES (ICCPCT), 2017,
[6] Optimization of configuration of corrugated airfoil using deep reinforcement learning and transfer learning
Noda, T.
Okabayashi, K.
Kimura, S.
Takeuchi, S.
Kajishima, T.
AIP ADVANCES, 2023, 13 (03)
[7] Prioritized Environment Configuration for Drone Control with Deep Reinforcement Learning
Jang, Sooyoung
Choi, Changbeom
HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2022, 12
[8] Reinforcement learning based energy efficient resource allocation strategy of MapReduce jobs with deadline constraint
Greeshma Lingam
Cluster Computing, 2023, 26 : 2719 - 2735
[9] Reinforcement learning based energy efficient resource allocation strategy of MapReduce jobs with deadline constraint
Lingam, Greeshma
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (05): : 2719 - 2735
[10] An Optimization Scheme in MapReduce for Reduce Stage
Liu, Qi
Cai, Weidong
Wang, Baowei
Fu, Zhangjie
Linge, Nigel
INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (08): : 197 - 207

← 1 2 3 4 5 →