Split miner: automated discovery of accurate and simple business process models from event logs

被引:132
作者
Augusto, Adriano [1 ,4 ]
Conforti, Raffaele [2 ]
Dumas, Marlon [5 ]
La Rosa, Marcello [2 ]
Polyvyanyy, Artem [3 ]
机构
[1] Univ Melbourne, Melbourne, Vic, Australia
[2] Univ Melbourne, Informat Syst, Melbourne, Vic, Australia
[3] Univ Melbourne, Melbourne Sch Engn, Sch Comp & Informat Syst, Melbourne, Vic, Australia
[4] Univ Tartu, Tartu, Estonia
[5] Univ Tartu, Informat Syst, Tartu, Estonia
基金
澳大利亚研究理事会;
关键词
Process mining; Automated process discovery; Event log; BPMN;
D O I
10.1007/s10115-018-1214-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of automated discovery of process models from event logs has been intensively researched in the past two decades. Despite a rich field of proposals, state-of-the-art automated process discovery methods suffer from two recurrent deficiencies when applied to real-life logs: (i) they produce large and spaghetti-like models; and (ii) they produce models that either poorly fit the event log (low fitness) or over-generalize it (low precision). Striking a trade-off between these quality dimensions in a robust and scalable manner has proved elusive. This paper presents an automated process discovery method, namely Split Miner, which produces simple process models with low branching complexity and consistently high and balanced fitness and precision, while achieving considerably faster execution times than state-of-the-art methods, measured on a benchmark covering twelve real-life event logs. Split Miner combines a novel approach to filter the directly-follows graph induced by an event log, with an approach to identify combinations of split gateways that accurately capture the concurrency, conflict and causal relations between neighbors in the directly-follows graph. Split Miner is also the first automated process discovery method that is guaranteed to produce deadlock-free process models with concurrency, while not being restricted to producing block-structured process models.
引用
收藏
页码:251 / 284
页数:34
相关论文
共 37 条
[1]   Measuring precision of modeled behavior [J].
Adriansyah, A. ;
Munoz-Gama, J. ;
Carmona, J. ;
van Dongen, B. F. ;
van der Aalst, W. M. P. .
INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2015, 13 (01) :37-67
[2]  
Adriansyah A., 2011, P EDOC
[3]  
[Anonymous], 2017, ABS170502288 CORR
[4]  
[Anonymous], 2017, BPTRENDS NEWSLETTER
[5]  
Augusto A, 2016, LNCS, V9974
[6]  
Augusto A, 2017, P 17 IEEE INT C DAT
[7]   Fodina: A robust and flexible heuristic process discovery technique [J].
Broucke, Seppe K. L. M. Vanden ;
De Weerdt, Jochen .
DECISION SUPPORT SYSTEMS, 2017, 100 :109-118
[8]  
Buijs J. C. A. M., 2012, LNCS
[9]   Business process control-flow complexity: Metric, evaluation, and validation [J].
Cardoso, Jorge .
INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2008, 5 (02) :49-76
[10]   Discovering exclusive patterns in frequent sequences [J].
Chen, Weiru ;
Lu, Jing ;
Keech, Malcolm .
INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2010, 2 (03) :252-267