Tuning Machine Learning to Address Process Mining Requirements

被引:3
作者
Ceravolo, Paolo [1 ]
Barbon, Sylvio, Jr. [2 ]
Damiani, Ernesto [3 ]
van der Aalst, Wil [4 ]
机构
[1] Univ Milan, Dept Comp Sci, I-20133 Milan, Italy
[2] Univ Trieste, Dept Engn & Architecture, I-34127 Trieste, Italy
[3] Khalifa Univ, Dept Elect Engn & Comp Sci, Abu Dhabi, U Arab Emirates
[4] Rhein Westfal TH Aachen, Chair Proc & Data Sci, D-52056 Aachen, Germany
关键词
Process mining; machine learning; non-parametric distribution; concurrency; non-stationary; zero-shot learning; encoding; training;
D O I
10.1109/ACCESS.2024.3361650
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning models are routinely integrated into process mining pipelines to carry out tasks like data transformation, noise reduction, anomaly detection, classification, and prediction. Often, the design of such models is based on some ad-hoc assumptions about the corresponding data distributions, which are not necessarily in accordance with the non-parametric distributions typically observed with process data. Moreover, mainstream machine-learning approaches tend to ignore the challenges posed by concurrency in operational processes. Data encoding is a key element to smooth the mismatch between these assumptions but its potential is poorly exploited. In this paper, we argue that a deeper understanding of the challenges associated with training machine learning models on process data is essential for establishing a robust integration of process mining and machine learning. Our analysis aims to lay the groundwork for a methodology that aligns machine learning with process mining requirements. We encourage further research in this direction to advance the field and effectively address these critical issues.
引用
收藏
页码:24583 / 24595
页数:13
相关论文
共 97 条
[1]   A Framework for Explainable Concept Drift Detection in Process Mining [J].
Adams, Jan Niklas ;
van Zelst, Sebastiaan J. ;
Quack, Lara ;
Hausmann, Kathrin ;
van der Aalst, Wil M. P. ;
Rose, Thomas .
BUSINESS PROCESS MANAGEMENT (BPM 2021), 2021, 12875 :400-416
[2]   Using Machine Learning in Business Process Re-Engineering [J].
Al-Anqoudi, Younis ;
Al-Hamdani, Abdullah ;
Al-Badawi, Mohamed ;
Hedjam, Rachid .
BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (04)
[3]  
[Anonymous], 2020, Process Mining in Healthcare Tutorial
[4]  
[Anonymous], 1998, Independent Component Analysis
[5]   A Co-Training Strategy for Multiple View Clustering in Process Mining [J].
Appice, Annalisa ;
Malerba, Donato .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (06) :832-845
[6]   Handling Concept Drift for Predictions in Business Process Mining [J].
Baier, Lucas ;
Reimold, Josua ;
Kuhl, Niklas .
2020 IEEE 22ND CONFERENCE ON BUSINESS INFORMATICS (CBI 2020), VOL I - RESEARCH PAPERS, 2020, :76-83
[7]   Anomaly Detection on Event Logs with a Scarcity of Labels [J].
Barbon Junior, Sylvio ;
Ceravolo, Paolo ;
Damiani, Ernesto ;
Omori, Nicolas Jashchenko ;
Tavares, Gabriel Marques .
2020 2ND INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2020), 2020, :161-168
[8]   Validating Vector-Label Propagation for Graph Embedding [J].
Bellandi, Valerio ;
Damiani, Ernesto ;
Ghirimoldi, Valerio ;
Maghool, Samira ;
Negri, Fedra .
COOPERATIVE INFORMATION SYSTEMS (COOPIS 2022), 2022, 13591 :259-276
[9]  
Berti A., 2023, BPM, P427
[10]   PM4Py: A process mining library for Python']Python [J].
Berti, Alessandro ;
van Zelst, Sebastiaan ;
Schuster, Daniel .
SOFTWARE IMPACTS, 2023, 17