Performance-preserving event log sampling for predictive monitoring

被引:7
作者
Sani, Mohammadreza Fani [1 ]
Vazifehdoostirani, Mozhgan [2 ]
Park, Gyunam [1 ]
Pegoraro, Marco [1 ]
van Zelst, Sebastiaan J. [1 ,3 ]
van der Aalst, Wil M. P. [1 ,3 ]
机构
[1] Rhein Westfal TH Aachen, Chair Proc & Data Sci, Aachen, Germany
[2] Eindhoven Univ Technol, Ind Engn & Innovat Sci, Eindhoven, Netherlands
[3] Fraunhofer FIT, Birlinghoven Castle, St Augustin, Germany
关键词
Process mining; Predictive monitoring; Sampling; Machine learning; Deep learning; Instance selection;
D O I
10.1007/s10844-022-00775-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predictive process monitoring is a subfield of process mining that aims to estimate case or event features for running process instances. Such predictions are of significant interest to the process stakeholders. However, most of the state-of-the-art methods for predictive monitoring require the training of complex machine learning models, which is often inefficient. Moreover, most of these methods require a hyper-parameter optimization that requires several repetitions of the training process which is not feasible in many real-life applications. In this paper, we propose an instance selection procedure that allows sampling training process instances for prediction models. We show that our instance selection procedure allows for a significant increase of training speed for next activity and remaining time prediction methods while maintaining reliable levels of prediction accuracy.
引用
收藏
页码:53 / 82
页数:30
相关论文
共 46 条
[1]   Natural language-based detection of semantic execution anomalies in event logs [J].
Aa, Han van der ;
Rebmann, Adrian ;
Leopold, Henrik .
INFORMATION SYSTEMS, 2021, 102
[2]   Time Matters: Time-Aware LSTMs for Predictive Business Process Monitoring [J].
An Nguyen ;
Chatterjee, Srijeet ;
SvenWeinzierl ;
Schwinn, Leo ;
Matzner, Martin ;
Eskofier, Bjoern .
PROCESS MINING WORKSHOPS, ICPM 2020 INTERNATIONAL WORKSHOPS, 2021, 406 :112-123
[3]  
Bergstra J., 2011, ADV NEURAL INFORM PR, P2546
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   COMPREHENSIBLE PREDICTIVE MODELS FOR BUSINESS PROCESSES [J].
Breuker, Dominic ;
Matzner, Martin ;
Delfmann, Patrick ;
Becker, Joerg .
MIS QUARTERLY, 2016, 40 (04) :1009-+
[6]  
Carmona Josep, 2012, Advances in Intelligent Data Analysis XI. Proceedings 11th International Symposium, IDA 2012, P90, DOI 10.1007/978-3-642-34156-4_10
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]   A general process mining framework for correlating, predicting and clustering dynamic behavior based on event logs [J].
de Leoni, Massimiliano ;
van der Aalst, Wil M. P. ;
Dees, Marcus .
INFORMATION SYSTEMS, 2016, 56 :235-257
[9]   Predictive Monitoring of Business Processes: A Survey [J].
Eduardo Marquez-Chamorro, Alfonso ;
Resinas, Manuel ;
Ruiz-Cortes, Antonio .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2018, 11 (06) :962-977
[10]   Predicting process behaviour using deep learning [J].
Evermann, Joerg ;
Rehse, Jana-Rebecca ;
Fettke, Peter .
DECISION SUPPORT SYSTEMS, 2017, 100 :129-140