A Survey of Graph Comparison Methods with Applications to Nondeterminism in High-Performance Computing

被引:1
作者
Bhowmick, Sanjukta [1 ]
Bell, Patrick [2 ]
Taufer, Michela [2 ]
机构
[1] Univ Northern Texas, Denton, TX USA
[2] Univ Tennessee, Knoxville Coll Engn, 401 Min H Kao Bldg,1520 Middle Dr, Knoxville, TN 37996 USA
基金
美国国家科学基金会;
关键词
Graph comparison; graph isomorphism; graph alignment; graph kernels; high-performance computing; NETWORK ALIGNMENT; ISOMORPHISM; ALGORITHM; SYSTEMS; REPLAY;
D O I
10.1177/10943420231166610
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The convergence of extremely high levels of hardware concurrency and the effective overlap of computation and communication in asynchronous executions has resulted in increasing nondeterminism in High-Performance Computing (HPC) applications. Nondeterminism can manifest at multiple levels: from low-level communication primitives to libraries to application-level functions. No matter its source, nondeterminism can drastically increase the cost of result reproducibility, debugging workflows, testing parallel programs, or ensuring fault-tolerance. Nondeterministic executions of HPC applications can be modeled as event graphs, and the applications' nondeterministic behavior can be understood and, in some cases, mitigated using graph comparison algorithms. However, a connection between graph comparison algorithms and approaches to understanding nondeterminism in HPC still needs to be established. This survey article moves the first steps toward establishing a connection between graph comparison algorithms and nondeterminism in HPC with its three contributions: it provides a survey of different graph comparison algorithms and a timeline for each category's significant works; it discusses how existing graph comparison methods do not fully support properties needed to understand nondeterministic patterns in HPC applications; and it presents the open challenges that should be addressed to leverage the power of graph comparisons for the study of nondeterminism in HPC applications.
引用
收藏
页码:306 / 327
页数:22
相关论文
共 137 条
[1]  
Abdelhamid E, 2016, SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, P716, DOI 10.1109/SC.2016.60
[2]   Printed circuit boards isomorphism: An experimental study [J].
Abiad, Aida ;
Grigoriev, Alexander ;
Niemzok, Stefanie .
COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 148
[3]  
Ahn DongH., 2013, Proceedings of the 1st International Workshop on Software Engineering for High Performance Computing in Computational Science and Engineering, SE-HPCCSE'13, P41
[4]  
Akutsu Tatsuya, 2013, Comput Struct Biotechnol J, V5, pe201302004, DOI 10.5936/csbj.201302004
[5]   Alignment-free protein interaction network comparison [J].
Ali, Waqar ;
Rito, Tiago ;
Reinert, Gesine ;
Sun, Fengzhu ;
Deane, Charlotte M. .
BIOINFORMATICS, 2014, 30 (17) :I430-I437
[6]   Inferring Hierarchical Motifs from Execution Traces [J].
Alimadadi, Saba ;
Mesbah, Ali ;
Pattabiraman, Karthik .
PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, :776-787
[7]   BEAMS: backbone extraction and merge strategy for the global many-to-many alignment of multiple PPI networks [J].
Alkan, Ferhat ;
Erten, Cesim .
BIOINFORMATICS, 2014, 30 (04) :531-539
[8]  
[Anonymous], 2015, Advances in Neural Information Processing Systems
[9]   Sequential and Parallel Solution-Biased Search for Subgraph Algorithms [J].
Archibald, Blair ;
Dunlop, Fraser ;
Hoffmann, Ruth ;
McCreesh, Ciaran ;
Prosser, Patrick ;
Trimble, James .
INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2019, 2019, 11494 :20-38
[10]  
Arifuzzaman S, 2015, PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, P1839, DOI 10.1109/BigData.2015.7363957