Graph neural networks for detecting anomalies in scientific workflows

被引：2

作者：

Jin, Hongwei ^{[1
,6
]}

Raghavan, Krishnan ^{[1
]}

Papadimitriou, George ^{[2
]}

Wang, Cong ^{[3
]}

Mandal, Anirban ^{[3
]}

Kiran, Mariam ^{[4
]}

Deelman, Ewa ^{[2
]}

Balaprakash, Prasanna ^{[5
]}

机构：

[1] Argonne Natl Lab, Lemont, IL USA

[2] Univ Southern Calif, Los Angeles, CA USA

[3] Renaissance Comp Inst RENCI, Chapel Hill, NC USA

[4] Energy Sci Network ESnet, Berkeley, CA USA

[5] Oak Ridge Natl Lab, Oak Ridge, TN USA

[6] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Lemont, IL 60439 USA

来源：

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS | 2023年 / 37卷 / 3-4期

关键词：

Anomaly detection; machine learning; graph neural networks; scientific workflows; hyperparameter tuning; explainable predictions;

D O I：

10.1177/10943420231172140

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Identifying and addressing anomalies in complex, distributed systems can be challenging for reliable execution of scientific workflows. We model these workflows as directed acyclic graphs (DAGs), where the nodes and edges of the DAGs represent jobs and their dependencies, respectively. We develop graph neural networks (GNNs) to learn patterns in the DAGs and to detect anomalies at the node (job) and graph (workflow) levels. We investigate workflow-specific GNN models that are trained on a particular workflow and workflow-agnostic GNN models that are trained across the workflows. Our GNN models, which incorporate both individual job features and topological information from the workflow, show improved accuracy and efficiency compared to conventional learning methods for detecting anomalies. While joint trained with multiple scientific workflows, our GNN models reached an accuracy more than 80% for workflow level and 75% for job level anomalies. In addition, we illustrate the importance of hyperparameter tuning method in our study that can significantly improve the metric(s) measure of evaluating the GNN models. Finally, we integrate explainable GNN methods to provide insights on job features in the workflow that cause an anomaly.

引用

页码：394 / 411

页数：18

共 50 条

[11] Detecting review fraud using metaheuristic graph neural networks
Oak R.
International Journal of Information Technology, 2024, 16 (7) : 4019 - 4025
[12] Detecting Synthesized Audio Files Using Graph Neural Networks
Izotova, O. A.
Lavrova, D. S.
AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2024, 58 (08) : 1212 - 1217
[13] Hybridization of Ontologies and Neural Networks in the Problems of Detecting Anomalies of Time Series
Moshkin, V. S.
Kurilo, D. S.
Andreev, I. A.
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 425 - 431
[14] Detecting Anomalies in Industrial Control Systems with LSTM Neural Networks and UEBA
Pinon-Blanco, Camilo
Otero-Vazquez, Fabian
Ortega-Fernandez, Ines
Sestelo, Marta
2023 JNIC CYBERSECURITY CONFERENCE, JNIC, 2023,
[15] Hybridization of Ontologies and Neural Networks in the Problems of Detecting Anomalies of Time Series
V. S. Moshkin
D. S. Kurilo
I. A. Andreev
Pattern Recognition and Image Analysis, 2023, 33 : 425 - 431
[16] Graph neural networks and cross-protocol analysis for detecting malicious IP addresses
Yonghong Huang
Joanna Negrete
John Wagener
Celeste Fralick
Armando Rodriguez
Eric Peterson
Adam Wosotowsky
Complex & Intelligent Systems, 2023, 9 : 3857 - 3869
[17] Neural Pooling for Graph Neural Networks
Harsha, Sai Sree
Mishra, Deepak
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 171 - 180
[18] Graph neural networks and cross-protocol analysis for detecting malicious IP addresses
Huang, Yonghong
Negrete, Joanna
Wagener, John
Fralick, Celeste
Rodriguez, Armando
Peterson, Eric
Wosotowsky, Adam
COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3857 - 3869
[19] Anomaly Detection in Scientific Workflows using End-to-End Execution Gantt Charts and Convolutional Neural Networks
Krawczuk, Patrycja
Papadimitriou, George
Nagarkar, Shubham
Kiran, Mariam
Mandal, Anirban
Deelman, Ewa
PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2021, PEARC 2021, 2021,
[20] Workflow Anomaly Detection with Graph Neural Networks
Jin, Hongwei
Raghavan, Krishnan
Papadimitriou, George
Wang, Cong
Mandal, Anirban
Krawczuk, Patrycja
Pottier, Loic
Kiran, Mariam
Deelman, Ewa
Balaprakash, Prasanna
2022 IEEE/ACM WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE, WORKS, 2022, : 35 - 42

← 1 2 3 4 5 →