Graph neural networks for detecting anomalies in scientific workflows

被引:2
|
作者
Jin, Hongwei [1 ,6 ]
Raghavan, Krishnan [1 ]
Papadimitriou, George [2 ]
Wang, Cong [3 ]
Mandal, Anirban [3 ]
Kiran, Mariam [4 ]
Deelman, Ewa [2 ]
Balaprakash, Prasanna [5 ]
机构
[1] Argonne Natl Lab, Lemont, IL USA
[2] Univ Southern Calif, Los Angeles, CA USA
[3] Renaissance Comp Inst RENCI, Chapel Hill, NC USA
[4] Energy Sci Network ESnet, Berkeley, CA USA
[5] Oak Ridge Natl Lab, Oak Ridge, TN USA
[6] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Lemont, IL 60439 USA
关键词
Anomaly detection; machine learning; graph neural networks; scientific workflows; hyperparameter tuning; explainable predictions;
D O I
10.1177/10943420231172140
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying and addressing anomalies in complex, distributed systems can be challenging for reliable execution of scientific workflows. We model these workflows as directed acyclic graphs (DAGs), where the nodes and edges of the DAGs represent jobs and their dependencies, respectively. We develop graph neural networks (GNNs) to learn patterns in the DAGs and to detect anomalies at the node (job) and graph (workflow) levels. We investigate workflow-specific GNN models that are trained on a particular workflow and workflow-agnostic GNN models that are trained across the workflows. Our GNN models, which incorporate both individual job features and topological information from the workflow, show improved accuracy and efficiency compared to conventional learning methods for detecting anomalies. While joint trained with multiple scientific workflows, our GNN models reached an accuracy more than 80% for workflow level and 75% for job level anomalies. In addition, we illustrate the importance of hyperparameter tuning method in our study that can significantly improve the metric(s) measure of evaluating the GNN models. Finally, we integrate explainable GNN methods to provide insights on job features in the workflow that cause an anomaly.
引用
收藏
页码:394 / 411
页数:18
相关论文
共 50 条
  • [31] Detecting distant homologies on protozoans metabolic pathways using scientific workflows
    Serra da Cruz, Sergio Manuel
    Batista, Vanessa
    Silva, Edno
    Tosta, Frederico
    Vilela, Clarissa
    Cuadrat, Rafael
    Tschoeke, Diogo
    Davila, Alberto M. R.
    Machado Campos, Maria Luiza
    Mattoso, Marta
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (03) : 256 - 280
  • [32] Graph Neural Networks for Graph Drawing
    Tiezzi, Matteo
    Ciravegna, Gabriele
    Gori, Marco
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4668 - 4681
  • [33] Graph Clustering with Graph Neural Networks
    Tsitsulin, Anton
    Palowitch, John
    Perozzi, Bryan
    Mueller, Emmanuel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [34] On the Use of Heterogeneous Graph Neural Networks for Detecting Malicious Activities: a Case Study with Cryptocurrencies
    Ferretti, Stefano
    D'Angelo, Gabriele
    Ghini, Vittorio
    PROCEEDINGS OF THE 2024 WORKSHOP ON OPEN CHALLENGES IN ONLINE SOCIAL NETWORKS, OASIS 2024, 2024, : 33 - 40
  • [35] GNNGLY: Graph Neural Networks for Glycan Classification
    Alkuhlani, Alhasan
    Gad, Walaa
    Roushdy, Mohamed
    Salem, Abdel-Badeeh M.
    IEEE ACCESS, 2023, 11 : 51838 - 51847
  • [36] Bank Fraud Detection with Graph Neural Networks
    A. I. Sergadeeva
    D. S. Lavrova
    D. P. Zegzhda
    Automatic Control and Computer Sciences, 2022, 56 : 865 - 873
  • [37] Detecting Network Anomalies and Intrusions in Communication Networks
    Rios, Ana Laura Gonzalez
    Li, Zhida
    Xu, Guangyu
    Alonso, Alfonso Diaz
    Trajkovic, Ljiljana
    2019 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES 2019), 2019, : 29 - 34
  • [38] Bank Fraud Detection with Graph Neural Networks
    Sergadeeva, A. I.
    Lavrova, D. S.
    Zegzhda, D. P.
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (08) : 865 - 873
  • [39] Evaluation of Cybersecurity Data Set Characteristics for Their Applicability to Neural Networks Algorithms Detecting Cybersecurity Anomalies
    Larriva-Novo, Xavier A.
    Vega-Barbas, Mario
    Villagra, Victor A.
    Sanz Rodrigo, Mario
    IEEE ACCESS, 2020, 8 : 9005 - 9014
  • [40] Process Discovery Using Graph Neural Networks
    Sommers, Dominique
    Menkovski, Vlado
    Fahland, Dirk
    2021 3RD INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2021), 2021, : 40 - 47