Validating the simulation of large-scale parallel applications using statistical characteristics

被引:1
作者
Zhang D. [1 ]
Wilke J. [2 ]
Hendry G. [2 ]
Dechev D. [1 ]
机构
[1] Department of Computer Science, University of Central Florida, 211 Harris Center (Building 116), 4000 Central Florida Boulevard, Orlando, 32816, FL
[2] Sandia National Laboratories, California, P.O. Box 969, Livermore, 94551-0969, CA
关键词
Evaluation metrics; Simulation evaluation; Software skeleton;
D O I
10.1145/2809778
中图分类号
学科分类号
摘要
Simulation is a widely adopted method to analyze and predict the performance of large-scale parallel applications. Validating the hardware model is highly important for complex simulations with a large number of parameters. Common practice involves calculating the percent error between the projected and the real execution time of a benchmark program. However, in a high-dimensional parameter space, this coarse-grained approach often suffers from parameter insensitivity, which may not be known a priori. Moreover, the traditional approach cannot be applied to the validation of software models, such as application skeletons used in online simulations. In this work, we present a methodology and a toolset for validating both hardware and software models by quantitatively comparing fine-grained statistical characteristics obtained from execution traces. Although statistical information has been used in tasks like performance optimization, this is the first attempt to apply it to simulation validation. Our experimental results show that the proposed evaluation approach offers significant improvement in fidelity when compared to evaluation using total execution time, and the proposed metrics serve as reliable criteria that progress toward automating the simulation tuning process. © 2016 ACM.
引用
收藏
相关论文
共 35 条
[21]  
Song H.J., Liu X., Jakobsen D., Bhagwan R., Zhang X., Taura K., Chien A., The microgrid: A scientific tool for modeling computational grids, Proceedings of the ACM/IEEE 2000 Conference on Supercomputing. IEEE, (2000)
[22]  
Sottile M., Dakshinamurthy A., Hendry G., Dechev D., Automatic extraction of software skeletons for benchmarking large-scale parallel applications, ACM SIGSIM Conference on Principles of Advanced Discrete Simulation (PADS, (2013)
[23]  
Sudheer C.D., Srinivasan A., Optimization of the hop-byte metric for effective topology aware mapping, Proceedings of the 2012 19th International Conference on High Performance Computing (HiPC, pp. 1-9, (2012)
[24]  
Susukita R., Ando H., Aoyagi M., Honda H., Inadomi Y., Inoue K., Ishizuki S., Kimura Y., Komatsu H., Kurokawa M., Performance prediction of large-scale parallell system and application using macro-level simulation, Proceedings of the 2008 ACM/IEEE Conference on Supercomputing. IEEE, 20, (2008)
[25]  
Tikir M., Laurenzano M., Carrington L., Snavely A., PSINS: An open source event tracer and execution simulator for MPI applications, Euro-Par 2009 Parallel Processing, pp. 135-148, (2009)
[26]  
Underwood K.D., Levenhagen M., Rodrigues A., Simulating red storm: Challenges and successes in building a system simulation, IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), pp. 1-10, (2007)
[27]  
Velho P., Legrand A., Accuracy study and improvement of network simulation in the SimGrid framework, Proceedings of the 2nd International Conference on Simulation Tools and Techniques. ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering), 13, (2009)
[28]  
Vetter J.S., McCracken M.O., Statistical scalability analysis of communication operations in distributed applications, ACM SIGPLAN Notices, 36, pp. 123-132, (2001)
[29]  
Weber M., Brendel R., Brunst H., Trace file comparison with a hierarchical sequence alignment algorithm, Proceedings of the 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications (ISPA). IEEE, pp. 247-254, (2012)
[30]  
Wilke J.J., Sargsyan K., Kenny J.P., Debusschere B., Najm H.N., Hendry G., Validation and uncertainty assessment of extreme-scale HPC simulation through Bayesian inference, Euro-Par 2013 Parallel Processing, pp. 41-52, (2013)