Measuring Fitness and Precision of Automatically Discovered Process Models: A Principled and Scalable Approach

被引:17
作者
Augusto, Adriano [1 ,2 ]
Conforti, Raffaele [2 ]
Armas-Cervantes, Abel [2 ]
Dumas, Marlon [1 ]
La Rosa, Marcello [2 ]
机构
[1] Univ Tartu, EE-50090 Tartu, Estonia
[2] Univ Melbourne, Parkville, Vic 3010, Australia
基金
澳大利亚研究理事会;
关键词
Process mining; automated process discovery; conformance checking; fitness; precision; CONFORMANCE CHECKING; AUTOMATED DISCOVERY; EVENT LOGS; ALGORITHMS;
D O I
10.1109/TKDE.2020.3003258
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated process discovery techniques allow us to generate a process model from an event log consisting of a collection of business process execution traces. The quality of process models generated by these techniques can be assessed with respect to several criteria, including fitness, which captures the degree to which the generated process model is able to recognize the traces in the event log, and precision, which captures the extent to which the behavior allowed by the process model is observed in the event log. A range of fitness and precision measures have been proposed in the literature. However, existing measures in this field do not fulfil basic monotonicity properties and/or they suffer from scalability issues when applied to models discovered from real-life event logs. This article presents a family of fitness and precision measures based on the idea of comparing the kth order Markovian abstraction of a process model against that of an event log. The article shows that this family of measures fulfils the aforementioned properties for suitably chosen values of k. An empirical evaluation shows that representative exemplars of this family of measures yield intuitive results on a synthetic dataset of model-log pairs, while outperforming existing measures of fitness and precision in terms of execution times on real-life event logs.
引用
收藏
页码:1870 / 1888
页数:19
相关论文
共 38 条
  • [1] Measuring precision of modeled behavior
    Adriansyah, A.
    Munoz-Gama, J.
    Carmona, J.
    van Dongen, B. F.
    van der Aalst, W. M. P.
    [J]. INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2015, 13 (01) : 37 - 67
  • [2] Conformance Checking using Cost-Based Fitness Analysis
    Adriansyah, A.
    van Dongen, B. F.
    van der Aalst, W. M. P.
    [J]. 15TH IEEE INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE (EDOC 2011), 2011, : 55 - 64
  • [3] Aguirre Santiago, 2017, International Journal of Business Process Integration and Management, V8, P102
  • [4] Alves de Medeiros A.K., 2006, THESIS EINDHOVEN U T
  • [5] Split miner: automated discovery of accurate and simple business process models from event logs
    Augusto, Adriano
    Conforti, Raffaele
    Dumas, Marlon
    La Rosa, Marcello
    Polyvyanyy, Artem
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (02) : 251 - 284
  • [6] Automated Discovery of Process Models from Event Logs: Review and Benchmark
    Augusto, Adriano
    Conforti, Raffaele
    Dumas, Marlon
    La Rosa, Marcello
    Maggi, Fabrizio Maria
    Marrella, Andrea
    Mecella, Massimo
    Soo, Allar
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (04) : 686 - 705
  • [7] Automated discovery of structured process models from event logs: The discover-and-structure approach
    Augusto, Adriano
    Conforti, Raffaele
    Dumas, Marlon
    La Rosa, Marcello
    Bruno, Giorgio
    [J]. DATA & KNOWLEDGE ENGINEERING, 2018, 117 : 373 - 392
  • [8] Abstract-and-Compare: A Family of Scalable Precision Measures for Automated Process Discovery
    Augusto, Adriano
    Armas-Cervantes, Abel
    Conforti, Raffaele
    Dumas, Marlon
    La Rosa, Marcello
    Reissner, Daniel
    [J]. BUSINESS PROCESS MANAGEMENT (BPM 2018), 2018, 11080 : 158 - 175
  • [9] Edit Distance Cannot Be Computed in Strongly Subquadratic Time (unless SETH is false)
    Backurs, Arturs
    Indyk, Piotr
    [J]. STOC'15: PROCEEDINGS OF THE 2015 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2015, : 51 - 58
  • [10] Fodina: A robust and flexible heuristic process discovery technique
    Broucke, Seppe K. L. M. Vanden
    De Weerdt, Jochen
    [J]. DECISION SUPPORT SYSTEMS, 2017, 100 : 109 - 118