FiM: Performance Prediction for Parallel Computation in Iterative Data Processing Applications

被引：29

作者：

Bhimani, Janki ^{[1
]}

Mi, Ningfang ^{[1
]}

Leeser, Miriam ^{[1
]}

Yang, Zhengyu ^{[1
]}

机构：

[1] Northeastern Univ, Dept Elect & Comp Engn, 360 Huntington Ave, Boston, MA 02115 USA

来源：

2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD) | 2017年

基金：

美国国家科学基金会;

关键词：

Performance Modeling; Markov Model; Regression; Distributed Systems; Cloud Computing; Big Data Infrastructure; MODEL;

D O I：

10.1109/CLOUD.2017.53

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Predicting performance of an application running on high performance computing (HPC) platforms in a cloud environment is increasingly becoming important because of its influence on development time and resource management. However, predicting the performance with respect to parallel processes is complex for iterative, multi-stage applications. This research proposes a performance approximation approach FiM to model the computing performance of iterative, multi-stage applications running on a master-compute framework. FiM consists of two key components that are coupled with each other: 1) Stochastic Markov Model to capture non-deterministic runtime that often depends on parallel resources, e.g., number of processes. 2) Machine Learning Model that extrapolates the parameters for calibrating our Markov model when we have changes in application parameters such as dataset. Our new modeling approach considers different design choices along multiple dimensions, namely (i) process level parallelism, (ii) distribution of cores on multi-core processors in cloud computing, (iii) application related parameters, and (iv) characteristics of datasets. The major contribution of our prediction approach is that FiM is able to provide an accurate prediction of parallel computation time for the datasets which have much larger size than that of the training datasets. Such calculation prediction provides data analysts a useful insight of optimal configuration of parallel resources (e.g., number of processes and number of cores) and also helps system designers to investigate the impact of changes in application parameters on system performance.

引用

页码：359 / 366

页数：8

共 17 条

[1]

Alexandrov A., 1995, LOGGP INCORPORATING

[2]

[Anonymous], 2015, DISCOVERY CLUSTER OV

[3]

[Anonymous], 2006, Journal of the Royal Statistical Society, Series B

[4] SimpleScalar: An infrastructure for computer system modeling [J].

Austin, T ;

Larson, E ;

Ernst, D .

COMPUTER, 2002, 35 (02) :59-+

[5]

Barnes BJ, 2008, ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, P368

[6]

Bhimani J., 2015, HIGH PERFORMANCE EXT, P1

[7]

Bhimani J., 2016, 35 IEEE INT PERF COM

[8] LogGPO: An accurate communication model for performance prediction of MPI programs [J].

Chen WenGuang ;

Zhai JiDong ;

Zhang Jin ;

Zheng WeiMin .

SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (10) :1785-1791

[9]

Chen XE, 2009, INT S HIGH PERF COMP, P329, DOI 10.1109/HPCA.2009.4798270

[10]

de Melo A. C., 2010, SLIDES LINUX K

← 1 2 →