Data-driven dynamic causality analysis of industrial systems using interpretable machine learning and process mining

被引:17
|
作者
Nadim, Karim [1 ,2 ]
Ragab, Ahmed [1 ,2 ,3 ]
Ouali, Mohamed-Salah [1 ]
机构
[1] Polytech Montreal, Dept Math & Ind Engn, Montreal, PQ H3T 1J4, Canada
[2] CanmetENERGY Nat Resources Canada NRCan, Varennes, PQ J3X 1P7, Canada
[3] Menoufia Univ, Fac Elect Engn, Menoufia 32952, Egypt
基金
加拿大自然科学与工程研究理事会;
关键词
Causality analysis; Interpretable machine learning; Process mining; Petri nets; Discrete event systems; Supervisory control; ROOT CAUSE DIAGNOSIS; FAULT-DIAGNOSIS; CHEMICAL-PROCESSES; GRANGER CAUSALITY; PROCESS MODELS; SIGNED DIGRAPHS; TREE ANALYSIS; PETRI NETS; GRAPH; MAP;
D O I
10.1007/s10845-021-01903-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The complexity of industrial processes imposes a lot of challenges in building accurate and representative causal models for abnormal events diagnosis, control and maintenance of equipment and process units. This paper presents an innovative data-driven causality modeling approach using interpretable machine learning and process mining techniques, in addition to human expertise, to efficiently and automatically capture the complex dynamics of industrial systems. The approach tackles a significant challenge in the causality analysis community, which is the discovery of high-level causal models from low-level continuous observations. It is based on the exploitation of event data logs by analyzing the dependency relationships between events to generate accurate multi-level models that can take the form of various state-event diagrams. Highly accurate and trustworthy patterns are extracted from the original data using interpretable machine learning integrated with a model enhancement technique to construct event data logs. Afterward, the causal model is generated from the event log using the inductive miner technique, which is one of the most powerful process mining techniques. The causal model generated is a Petri net model, which is used to infer causality between important events as well as a visualization tool for real-time tracking of the system's dynamics. The proposed causality modeling approach has been successfully tested based on a real industrial dataset acquired from complex equipment in a Kraft pulp mill located in eastern Canada. The generated causality model was validated by ensuring high model fitness scores, in addition to the process expert's validation of the results.
引用
收藏
页码:57 / 83
页数:27
相关论文
共 50 条
  • [31] Anomaly detection for industrial control systems using process mining
    Myers, David
    Suriadi, Suriadi
    Radke, Kenneth
    Foo, Ernest
    COMPUTERS & SECURITY, 2018, 78 : 103 - 125
  • [32] A Data-Driven Fault Tree for a Time Causality Analysis in an Aging System
    Waghen, Kerelous
    Ouali, Mohamed-Salah
    ALGORITHMS, 2022, 15 (06)
  • [33] Mining Learning Management System Data Using Interpretable Neural Networks
    Matetic, M.
    2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1282 - 1287
  • [34] Maritime-Accident-Induced Environmental Pollution and Economic Loss Analysis Using an Interpretable Data-Driven Method
    Zhang, Zeguo
    Hu, Qinyou
    Yin, Jianchuan
    SUSTAINABILITY, 2025, 17 (07)
  • [35] Explainable artificial intelligence and interpretable machine learning for agricultural data analysis
    Ryo, Masahiro
    ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2022, 6 : 257 - 265
  • [36] Towards Data-Driven Business Process Redesign Through the Lens of Process Mining Case Studies
    Wang, Zeping
    Syed, Rehan
    Ouyang, Chun
    BUSINESS PROCESS MANAGEMENT WORKSHOPS, BPM 2023, 2024, 492 : 259 - 271
  • [37] Interpretable machine learning methods for predictions in systems biology from omics data
    Sidak, David
    Schwarzerova, Jana
    Weckwerth, Wolfram
    Waldherr, Steffen
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9
  • [38] Investigation of lawsuit process duration using machine learning and process mining
    Luiz Vercosa
    Vinicius Silva
    Jaqueline Cruz
    Carmelo Bastos-Filho
    Byron L. D. Bezerra
    Discover Analytics, 2 (1):
  • [39] Environmental determinants of dynamic jogging patterns: Insights from trajectory big data analysis and interpretable machine learning
    Yang, Wei
    Fei, Jun
    Li, Jingjing
    Li, Wende
    Xie, Xuefeng
    APPLIED GEOGRAPHY, 2025, 178
  • [40] A data-driven Bayesian network learning method for process fault diagnosis
    Amin, Md Tanjin
    Khan, Faisal
    Ahmed, Salim
    Imtiaz, Syed
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2021, 150 : 110 - 122