Graph neural networks for detecting anomalies in scientific workflows

被引：2

作者：

Jin, Hongwei ^{[1
,6
]}

Raghavan, Krishnan ^{[1
]}

Papadimitriou, George ^{[2
]}

Wang, Cong ^{[3
]}

Mandal, Anirban ^{[3
]}

Kiran, Mariam ^{[4
]}

Deelman, Ewa ^{[2
]}

Balaprakash, Prasanna ^{[5
]}

机构：

[1] Argonne Natl Lab, Lemont, IL USA

[2] Univ Southern Calif, Los Angeles, CA USA

[3] Renaissance Comp Inst RENCI, Chapel Hill, NC USA

[4] Energy Sci Network ESnet, Berkeley, CA USA

[5] Oak Ridge Natl Lab, Oak Ridge, TN USA

[6] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Lemont, IL 60439 USA

来源：

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS | 2023年 / 37卷 / 3-4期

关键词：

Anomaly detection; machine learning; graph neural networks; scientific workflows; hyperparameter tuning; explainable predictions;

D O I：

10.1177/10943420231172140

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Identifying and addressing anomalies in complex, distributed systems can be challenging for reliable execution of scientific workflows. We model these workflows as directed acyclic graphs (DAGs), where the nodes and edges of the DAGs represent jobs and their dependencies, respectively. We develop graph neural networks (GNNs) to learn patterns in the DAGs and to detect anomalies at the node (job) and graph (workflow) levels. We investigate workflow-specific GNN models that are trained on a particular workflow and workflow-agnostic GNN models that are trained across the workflows. Our GNN models, which incorporate both individual job features and topological information from the workflow, show improved accuracy and efficiency compared to conventional learning methods for detecting anomalies. While joint trained with multiple scientific workflows, our GNN models reached an accuracy more than 80% for workflow level and 75% for job level anomalies. In addition, we illustrate the importance of hyperparameter tuning method in our study that can significantly improve the metric(s) measure of evaluating the GNN models. Finally, we integrate explainable GNN methods to provide insights on job features in the workflow that cause an anomaly.

引用

页码：394 / 411

页数：18

共 50 条

[31] Detecting distant homologies on protozoans metabolic pathways using scientific workflows
Serra da Cruz, Sergio Manuel
Batista, Vanessa
Silva, Edno
Tosta, Frederico
Vilela, Clarissa
Cuadrat, Rafael
Tschoeke, Diogo
Davila, Alberto M. R.
Machado Campos, Maria Luiza
Mattoso, Marta
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (03) : 256 - 280
[32] Graph Neural Networks for Graph Drawing
Tiezzi, Matteo
Ciravegna, Gabriele
Gori, Marco
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4668 - 4681
[33] Graph Clustering with Graph Neural Networks
Tsitsulin, Anton
Palowitch, John
Perozzi, Bryan
Mueller, Emmanuel
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[34] On the Use of Heterogeneous Graph Neural Networks for Detecting Malicious Activities: a Case Study with Cryptocurrencies
Ferretti, Stefano
D'Angelo, Gabriele
Ghini, Vittorio
PROCEEDINGS OF THE 2024 WORKSHOP ON OPEN CHALLENGES IN ONLINE SOCIAL NETWORKS, OASIS 2024, 2024, : 33 - 40
[35] GNNGLY: Graph Neural Networks for Glycan Classification
Alkuhlani, Alhasan
Gad, Walaa
Roushdy, Mohamed
Salem, Abdel-Badeeh M.
IEEE ACCESS, 2023, 11 : 51838 - 51847
[36] Bank Fraud Detection with Graph Neural Networks
A. I. Sergadeeva
D. S. Lavrova
D. P. Zegzhda
Automatic Control and Computer Sciences, 2022, 56 : 865 - 873
[37] Detecting Network Anomalies and Intrusions in Communication Networks
Rios, Ana Laura Gonzalez
Li, Zhida
Xu, Guangyu
Alonso, Alfonso Diaz
Trajkovic, Ljiljana
2019 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES 2019), 2019, : 29 - 34
[38] Bank Fraud Detection with Graph Neural Networks
Sergadeeva, A. I.
Lavrova, D. S.
Zegzhda, D. P.
AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (08) : 865 - 873
[39] Evaluation of Cybersecurity Data Set Characteristics for Their Applicability to Neural Networks Algorithms Detecting Cybersecurity Anomalies
Larriva-Novo, Xavier A.
Vega-Barbas, Mario
Villagra, Victor A.
Sanz Rodrigo, Mario
IEEE ACCESS, 2020, 8 : 9005 - 9014
[40] Process Discovery Using Graph Neural Networks
Sommers, Dominique
Menkovski, Vlado
Fahland, Dirk
2021 3RD INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2021), 2021, : 40 - 47

← 1 2 3 4 5 →