Automatic Refinement of Parallel Applications Structure Detection

被引:9
作者
Gonzalez, Juan [1 ]
Huck, Kevin [2 ]
Gimenez, Judit [1 ]
Labarta, Jesus [1 ]
机构
[1] Univ Politecn Catalunya Barcelona Tech, Barcelona Supercomp Ctr, Barcelona, Spain
[2] ParaTools Inc, Eugene, OR USA
来源
2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW) | 2012年
关键词
parallel applications; performance analysis; automatic analysis; cluster analysis; data mining;
D O I
10.1109/IPDPSW.2012.209
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Analyzing parallel programs has become increasingly difficult due to the immense amount of information collected on large systems. In this scenario, cluster analysis has been proved to be a useful technique to reduce the amount of data to analyze. A good example is the use of the density-based cluster algorithm DBSCAN to identify similar single program multiple data (SPMD) computing phases in message-passing applications. This structure detection simplifies the analyst work as the whole information available is reduced to a small set of clusters. However, DBSCAN presents two major problems: it is very sensitive to its parametrization and is not capable of correctly detect clusters when the data set has different densities across the data space. In this paper, we introduce the Aggregative Cluster Refinement, an iterative algorithm that produces more accurate structure detections of SPMD phases than DBSCAN. In addition, it is able to detect clusters with different densities.
引用
收藏
页码:1680 / 1687
页数:8
相关论文
共 21 条
[1]  
Ahn DongH., 2002, Proceedings of Supercomputing, P1
[2]  
Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
[3]  
[Anonymous], 2000, ICML
[4]  
[Anonymous], 2005, ACMIEEE SC 2005 C SC
[5]  
[Anonymous], 2004, P 11 ECMWF WORKSH US
[6]  
[Anonymous], IPDPS 09
[7]   SeqAn An efficient, generic C++ library for sequence analysis [J].
Doering, Andreas ;
Weese, David ;
Rausch, Tobias ;
Reinert, Knut .
BMC BIOINFORMATICS, 2008, 9 (1)
[8]  
Ester M., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P226
[9]  
Gonzalez J., 2009, PDCAT 09
[10]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830