Split miner: automated discovery of accurate and simple business process models from event logs

被引:0
作者
Adriano Augusto
Raffaele Conforti
Marlon Dumas
Marcello La Rosa
Artem Polyvyanyy
机构
[1] The University of Melbourne,
[2] University of Tartu,undefined
来源
Knowledge and Information Systems | 2019年 / 59卷
关键词
Process mining; Automated process discovery; Event log; BPMN;
D O I
暂无
中图分类号
学科分类号
摘要
The problem of automated discovery of process models from event logs has been intensively researched in the past two decades. Despite a rich field of proposals, state-of-the-art automated process discovery methods suffer from two recurrent deficiencies when applied to real-life logs: (i) they produce large and spaghetti-like models; and (ii) they produce models that either poorly fit the event log (low fitness) or over-generalize it (low precision). Striking a trade-off between these quality dimensions in a robust and scalable manner has proved elusive. This paper presents an automated process discovery method, namely Split Miner, which produces simple process models with low branching complexity and consistently high and balanced fitness and precision, while achieving considerably faster execution times than state-of-the-art methods, measured on a benchmark covering twelve real-life event logs. Split Miner combines a novel approach to filter the directly-follows graph induced by an event log, with an approach to identify combinations of split gateways that accurately capture the concurrency, conflict and causal relations between neighbors in the directly-follows graph. Split Miner is also the first automated process discovery method that is guaranteed to produce deadlock-free process models with concurrency, while not being restricted to producing block-structured process models.
引用
收藏
页码:251 / 284
页数:33
相关论文
共 50 条
[1]  
Adriansyah A(2015)Measuring precision of modeled behavior ISeB 13 37-67
[2]  
Muñoz-Gama J(2008)Business process control-flow complexity: metric, evaluation, and validation Int J Web Serv Res 5 49-76
[3]  
Carmona J(2010)Discovering exclusive patterns in frequent sequences Int J Data Min Model Manag 2 252-267
[4]  
van Dongen BF(2016)BPMN miner: automated discovery of BPMN process models with hierarchical structure Inf Syst 56 284-303
[5]  
van der Aalst WMP(2017)Filtering out infrequent behavior from business process event logs IEEE Trans Knowl Data Eng 29 300-314
[6]  
Cardoso JS(2012)A multi-dimensional quality assessment of state-of-the-art process discovery algorithms using real-life event logs Inf Syst 37 654-676
[7]  
Chen W(1959)A note on two problems in connexion with graphs Numer Math 1 269-271
[8]  
Lu J(1997)Discovery of frequent episodes in event sequences Data Min Knowl Discov 1 259-289
[9]  
Keech M(2010)Seven process modeling guidelines (7PMG) Inf Softw Technol 52 127-136
[10]  
Conforti R(2006)Discovering frequent closed partial orders from strings IEEE Trans Knowl Data Eng 18 1467-1481