Task failure resilience technique for improving the performance of MapReduce in Hadoop

被引：10

作者：

Kavitha, C. ^{[1
]}

Anita, X. ^{[2
]}

机构：

[1] Anna Univ, Dept Informat & Commun Engn, Chennai, Tamil Nadu, India

[2] Jerusalem Coll Engn, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India

来源：

ETRI JOURNAL | 2020年 / 42卷 / 05期

关键词：

Hadoop; in-memory; key-value pair; MapReduce; recovery; Redis cache; resilience; task failure;

D O I：

10.4218/etrij.2018-0265

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

MapReduce is a framework that can process huge datasets in parallel and distributed computing environments. However, a single machine failure during the runtime of MapReduce tasks can increase completion time by 50%. MapReduce handles task failures by restarting the failed task and re-computing all input data from scratch, regardless of how much data had already been processed. To solve this issue, we need the computed key-value pairs to persist in a storage system to avoid re-computing them during the restarting process. In this paper, the task failure resilience (TFR) technique is proposed, which allows the execution of a failed task to continue from the point it was interrupted without having to redo all the work. Amazon ElastiCache for Redis is used as a non-volatile cache for the key-value pairs. We measured the performance of TFR by running different Hadoop benchmarking suites. TFR was implemented using the Hadoop software framework, and the experimental results showed significant performance improvements when compared with the performance of the default Hadoop implementation.

引用

页码：751 / 763

页数：13

共 23 条

[1]

8K Miles, 2014, BILL MESS ART ARCH S

[2]

Acharya S, 2000, SIGMOD REC, V29, P487

[3]

Antunes F, 2011, INT CONF BUS ADMIN, P32

[4]

AWS, WHAT IS AM ELASTICAC

[5] MRSIM: Mitigating Reducer Skew In MapReduce [J].

Chen, Lei ;

Lu, Wei ;

Che, Xiaoping ;

Xing, Weiwei ;

Wang, Liqiang ;

Yang, Yong .

2017 31ST IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (IEEE WAINA 2017), 2017, :379-384

[6] aHDFS: An Erasure-Coded Data Archival System for Hadoop Clusters [J].

Chen, Yuanqi ;

Zhou, Yi ;

Taneja, Shubbhi ;

Qin, Xiao ;

Huang, Jianzhong .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (11) :3060-3073

[7]

Dean J., 2006, PACT, P1

[8]

Dittrich J, 2010, PROC VLDB ENDOW, V3, P518

[9] SHadoop: Improving MapReduce performance by optimizing job execution mechanism in Hadoop clusters [J].

Gu, Rong ;

Yang, Xiaoliang ;

Yan, Jinshuang ;

Sun, Yuanhao ;

Wang, Bing ;

Yuan, Chunfeng ;

Huang, Yihua .

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (03) :2166-2179

[10]

Herodotou H., 2011, ARXIV11060940, P1

← 1 2 3 →