Predicting Hadoop misconfigurations using machine learning

被引:1
作者
Robert, Andrew [1 ]
Gupta, Apaar [1 ]
Shenoy, Vinayak [1 ]
Sitaram, Dinkar [1 ]
Kalambur, Subramaniam [1 ]
机构
[1] PES Univ, Ctr Cloud Comp & Big Data, Dept Comp Sci & Engn, Bangalore, Karnataka, India
关键词
big data; Hadoop; machine learning; misconfiguration; prediction;
D O I
10.1002/spe.2790
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Distributed applications are popular for heavy workloads where the resources of a single machine are not sufficient. These distributed applications come with many parameters to tune so that cluster resources can be effectively utilized. However, any misconfiguration of the available parameters may result in suboptimal performance of one or more machines in the cluster. These events may go unnoticed or can result in crashes. This problem of misconfigured parameters has no straightforward solution due to the variety of parameters and vastly different workloads being processed. In this article, we propose a methodology for machine learning-based detection of misconfigurations. We collect data mined from system resource utilization, Hadoop logs, and job-level metrics to train a model using decision tree and support vector machine. The models are used to identify whether a set of configuration parameters could result in a crash or a slowdown for a specific workload. The approach explained in this article can be extended to other distributed big data applications, such as Spark, Hive, Pig, and so on.
引用
收藏
页码:1168 / 1183
页数:16
相关论文
共 48 条
[1]  
Al-Shaer E., 2010, P 3 ACM WORKSHOP ASS, P37, DOI DOI 10.1145/1866898.1866905
[2]  
[Anonymous], 2012, 9 USENIX S NETWORKED
[3]  
[Anonymous], 2 7 3 YARN SIT XML
[4]  
[Anonymous], 2010, HotCloud
[5]  
[Anonymous], P 2007 WORKSH TACKL
[6]  
Attariyan M., 2010, 9 USENIX S OP SYST D
[7]  
Cloudera, 2014, SOM OTH CRUC PAR
[8]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[9]   Execution Anomaly Detection in Distributed Systems through Unstructured Log Analysis [J].
Fu, Qiang ;
Lou, Jian-Guang ;
Wang, Yi ;
Li, Jiang .
2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, :149-+
[10]  
Hadoop, 2018, YARN DEF XML PAR