Validating the simulation of large-scale parallel applications using statistical characteristics

被引:1
作者
Zhang D. [1 ]
Wilke J. [2 ]
Hendry G. [2 ]
Dechev D. [1 ]
机构
[1] Department of Computer Science, University of Central Florida, 211 Harris Center (Building 116), 4000 Central Florida Boulevard, Orlando, 32816, FL
[2] Sandia National Laboratories, California, P.O. Box 969, Livermore, 94551-0969, CA
关键词
Evaluation metrics; Simulation evaluation; Software skeleton;
D O I
10.1145/2809778
中图分类号
学科分类号
摘要
Simulation is a widely adopted method to analyze and predict the performance of large-scale parallel applications. Validating the hardware model is highly important for complex simulations with a large number of parameters. Common practice involves calculating the percent error between the projected and the real execution time of a benchmark program. However, in a high-dimensional parameter space, this coarse-grained approach often suffers from parameter insensitivity, which may not be known a priori. Moreover, the traditional approach cannot be applied to the validation of software models, such as application skeletons used in online simulations. In this work, we present a methodology and a toolset for validating both hardware and software models by quantitatively comparing fine-grained statistical characteristics obtained from execution traces. Although statistical information has been used in tasks like performance optimization, this is the first attempt to apply it to simulation validation. Our experimental results show that the proposed evaluation approach offers significant improvement in fidelity when compared to evaluation using total execution time, and the proposed metrics serve as reliable criteria that progress toward automating the simulation tuning process. © 2016 ACM.
引用
收藏
相关论文
共 35 条
[1]  
Albing C., Troullier N., Whalen S., Olson R., Glenski J., Pritchard H., Mills H., Scalable node allocation for improved performance in regular and anisotropic 3D torus supercomputers, Recent Advances in the Message Passing Interface, pp. 61-70, (2011)
[2]  
Clauss P.N., Stillwell M., Genaud S., Suter F., Casanova H., Quinson M., Single node on-line simulation of MPI applications with SMPI, 2011 IEEE International Parallel&Distributed Processing Symposium (IPDPS), pp. 664-675, (2011)
[3]  
Denzel W.E., Li J., Walker P., Jin Y., A framework for end-to-end simulation of high-performance computing systems, Simulation, 86, 5-6, pp. 331-350, (2010)
[4]  
Desprez F., Markomanolis G.S., Quinson M., Suter F., Assessing the performance of MPI applications through time-independent trace replay, Proceedings of the 2011 40th International Conference on Parallel Processing Workshops (ICPPW, pp. 467-476, (2011)
[5]  
Geimer M., Wolf F., Wylie B.J.N., Abraham E., Becker D., Mohr B., The Scalasca performance toolset architecture, Concurrency and Computation: Practice and Experience, 22, 6, pp. 702-719, (2010)
[6]  
Hammond S.D., Mudalige G.R., Smith J.A., Jarvis S.A., Herdman J.A., Vadgama A., WARPP: A toolkit for simulating high-performance parallel scientific codes, Proceedings of the 2nd International Conference on Simulation Tools and Techniques. ICST (Institute for Computer Sciences, Social- Informatics and Telecommunications Engineering), 19, (2009)
[7]  
Hoefler T., Schneider T., Lumsdaine A., LogGOPSim-simulating large-scale applications in the LogGOPS model, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC), 10, pp. 597-604, (2010)
[8]  
Huck K.A., Malony A.D., Shende S., Jacobsen D.W., Integrated measurement for cross-platform OpenMP performance analysis, Using and Improving OpenMP for Devices, Tasks, and More, pp. 146-160, (2014)
[9]  
Janssen C.L., Adalsteinsson H., Cranford S., Kenny J.P., Pinar A., Evensky D.A., Mayo J., A simulator for large-scale parallel computer architectures, International Journal of Distributed Systems and Technologies, 1, 2, pp. 57-73, (2010)
[10]  
Janssen C., Quinlan D., Shalf J., Architectural Simulation for Exascale Hardware/Software Co-Design, (2011)