Graph neural networks for detecting anomalies in scientific workflows

被引:2
|
作者
Jin, Hongwei [1 ,6 ]
Raghavan, Krishnan [1 ]
Papadimitriou, George [2 ]
Wang, Cong [3 ]
Mandal, Anirban [3 ]
Kiran, Mariam [4 ]
Deelman, Ewa [2 ]
Balaprakash, Prasanna [5 ]
机构
[1] Argonne Natl Lab, Lemont, IL USA
[2] Univ Southern Calif, Los Angeles, CA USA
[3] Renaissance Comp Inst RENCI, Chapel Hill, NC USA
[4] Energy Sci Network ESnet, Berkeley, CA USA
[5] Oak Ridge Natl Lab, Oak Ridge, TN USA
[6] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Lemont, IL 60439 USA
关键词
Anomaly detection; machine learning; graph neural networks; scientific workflows; hyperparameter tuning; explainable predictions;
D O I
10.1177/10943420231172140
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying and addressing anomalies in complex, distributed systems can be challenging for reliable execution of scientific workflows. We model these workflows as directed acyclic graphs (DAGs), where the nodes and edges of the DAGs represent jobs and their dependencies, respectively. We develop graph neural networks (GNNs) to learn patterns in the DAGs and to detect anomalies at the node (job) and graph (workflow) levels. We investigate workflow-specific GNN models that are trained on a particular workflow and workflow-agnostic GNN models that are trained across the workflows. Our GNN models, which incorporate both individual job features and topological information from the workflow, show improved accuracy and efficiency compared to conventional learning methods for detecting anomalies. While joint trained with multiple scientific workflows, our GNN models reached an accuracy more than 80% for workflow level and 75% for job level anomalies. In addition, we illustrate the importance of hyperparameter tuning method in our study that can significantly improve the metric(s) measure of evaluating the GNN models. Finally, we integrate explainable GNN methods to provide insights on job features in the workflow that cause an anomaly.
引用
收藏
页码:394 / 411
页数:18
相关论文
共 50 条
  • [11] Detecting review fraud using metaheuristic graph neural networks
    Oak R.
    International Journal of Information Technology, 2024, 16 (7) : 4019 - 4025
  • [12] Detecting Synthesized Audio Files Using Graph Neural Networks
    Izotova, O. A.
    Lavrova, D. S.
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2024, 58 (08) : 1212 - 1217
  • [13] Hybridization of Ontologies and Neural Networks in the Problems of Detecting Anomalies of Time Series
    Moshkin, V. S.
    Kurilo, D. S.
    Andreev, I. A.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 425 - 431
  • [14] Detecting Anomalies in Industrial Control Systems with LSTM Neural Networks and UEBA
    Pinon-Blanco, Camilo
    Otero-Vazquez, Fabian
    Ortega-Fernandez, Ines
    Sestelo, Marta
    2023 JNIC CYBERSECURITY CONFERENCE, JNIC, 2023,
  • [15] Hybridization of Ontologies and Neural Networks in the Problems of Detecting Anomalies of Time Series
    V. S. Moshkin
    D. S. Kurilo
    I. A. Andreev
    Pattern Recognition and Image Analysis, 2023, 33 : 425 - 431
  • [16] Graph neural networks and cross-protocol analysis for detecting malicious IP addresses
    Yonghong Huang
    Joanna Negrete
    John Wagener
    Celeste Fralick
    Armando Rodriguez
    Eric Peterson
    Adam Wosotowsky
    Complex & Intelligent Systems, 2023, 9 : 3857 - 3869
  • [17] Neural Pooling for Graph Neural Networks
    Harsha, Sai Sree
    Mishra, Deepak
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 171 - 180
  • [18] Graph neural networks and cross-protocol analysis for detecting malicious IP addresses
    Huang, Yonghong
    Negrete, Joanna
    Wagener, John
    Fralick, Celeste
    Rodriguez, Armando
    Peterson, Eric
    Wosotowsky, Adam
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3857 - 3869
  • [19] Anomaly Detection in Scientific Workflows using End-to-End Execution Gantt Charts and Convolutional Neural Networks
    Krawczuk, Patrycja
    Papadimitriou, George
    Nagarkar, Shubham
    Kiran, Mariam
    Mandal, Anirban
    Deelman, Ewa
    PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2021, PEARC 2021, 2021,
  • [20] Workflow Anomaly Detection with Graph Neural Networks
    Jin, Hongwei
    Raghavan, Krishnan
    Papadimitriou, George
    Wang, Cong
    Mandal, Anirban
    Krawczuk, Patrycja
    Pottier, Loic
    Kiran, Mariam
    Deelman, Ewa
    Balaprakash, Prasanna
    2022 IEEE/ACM WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE, WORKS, 2022, : 35 - 42