Bootstrapping Generalization of Process Models Discovered from Event Data

被引:5
作者
Polyvyanyy, Artem [1 ]
Moffat, Alistair [1 ]
Garcia-Banuelos, Luciano [2 ]
机构
[1] Univ Melbourne, Melbourne, Vic 3010, Australia
[2] Tecnol Monterrey, Monterrey 64849, NL, Mexico
来源
ADVANCED INFORMATION SYSTEMS ENGINEERING (CAISE 2022) | 2022年
基金
澳大利亚研究理事会;
关键词
Process mining; Generalization; Bootstrapping; Consistent estimator;
D O I
10.1007/978-3-031-07472-1_3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Process mining extracts value from the traces recorded in the event logs of IT-systems, with process discovery the task of inferring a process model for a log emitted by some unknown system. Generalization is one of the quality criteria applied to process models to quantify how well the model describes future executions of the system. Generalization is also perhaps the least understood of those criteria, with that lack primarily a consequence of it measuring properties over the entire future behavior of the system when the only available sample of behavior is that provided by the log. In this paper, we apply a bootstrap approach from computational statistics, allowing us to define an estimator of the model's generalization based on the log it was discovered from. We show that standard process mining assumptions lead to a consistent estimator that makes fewer errors as the quality of the log increases. Experiments confirm the ability of the approach to support industry-scale data-driven systems engineering.
引用
收藏
页码:36 / 54
页数:19
相关论文
共 26 条
  • [1] Alkhammash H., 2020, GARCIA BANUELOS DISC, DOI [10.26188/12814535, DOI 10.26188/12814535]
  • [2] Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
  • [3] Quality Dimensions in Process Discovery: The Importance of Fitness, Precision, Generalization and Simplicity
    Buijs, J. C. A. M.
    van Dongen, B. F.
    van der Aalst, W. M. P.
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2014, 23 (01)
  • [4] Carmona J., 2018, Conformance Checking - Relating Processes and Models
  • [5] Efron B., 1993, An introduction to the bootstrap, DOI 10.1007/978-1-4899-4541-9
  • [6] Efron B., 1982, JACKKNIFE BOOTSTRAP, DOI DOI 10.1137/1.9781611970319.CH5
  • [7] Towards Confirmatory Process Discovery: Making Assertions About the Underlying System
    Janssenswillen, Gert
    Depaire, Benoit
    [J]. BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2019, 61 (06) : 713 - 728
  • [8] A comparative study of existing quality measures for process discovery
    Janssenswillen, Gert
    Donders, Niels
    Jouck, Toon
    Depaire, Benoit
    [J]. INFORMATION SYSTEMS, 2017, 71 : 1 - 15
  • [9] Leemans Sander J. J., 2013, Application and Theory of Petri Nets and Concurrency. 34th International Conference, PETRI NETS 2013. Proceedings: LNCS 7927, P311, DOI 10.1007/978-3-642-38697-8_17
  • [10] Scalable process discovery and conformance checking
    Leemans, Sander J. J.
    Fahland, Dirk
    Van der Aalst, Wil M. P.
    [J]. SOFTWARE AND SYSTEMS MODELING, 2018, 17 (02) : 599 - 631