Approximate conformance checking: Fast computation of multi-perspective, probabilistic alignments

被引:0
作者
Gianola, Alessandro [1 ]
Ko, Jonghyeon [2 ]
Maggi, Fabrizio Maria [2 ]
Montali, Marco [2 ]
Winkler, Sarah [2 ]
机构
[1] Univ Lisbon, INESC ID Inst Super Tecn, Rua Alves Redol 9, P-1000019 Lisbon, Portugal
[2] Free Univ Bozen Bolzano, Piazza Domenicani 3, I-39100 Bolzano, BZ, Italy
关键词
Conformance checking; Trace encoding; Multi-perspective process mining; SMT;
D O I
10.1016/j.is.2024.102510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the context of process mining, alignments are increasingly being adopted for conformance checking, due to their ability in providing sophisticated diagnostics on the nature and extent of deviations between observed traces and a reference process model. On the downside, deriving alignments is challenging from the computational point of view, even more so when dealing with multiple perspectives in the process, such as, in particular, data. In fact, every observed trace must in principle be compared with infinitely many model traces. In this work, we tackle this computational bottleneck by borrowing the classical idea of encoding from machine learning. Instead of computing alignments directly and exactly, we do so in an approximate way after applying a lossy trace encoding that maps each trace into a corresponding compact, vectorial representation that retains only certain information of the original trace. We study trace encoding-based approximate alignments for processes equipped with event data attributes, from three different angles. First, we indeed show that computing approximate alignments in this way is much more efficient than in the exact setting. Second, we evaluate how accurate such approximate alignments are, considering different encoding strategies that focus on different features of the trace. Our findings suggest that sufficiently rich encodings actually yield good accuracy. Third, we consider the impact of frequency and density of model variants, comparing the effectiveness of using standard approximate multi-perspective alignments as opposed to a variant that incorporates probabilities. As a by-product of this analysis, we also obtain insights on how these two approaches perform in the presence of noise.
引用
收藏
页数:16
相关论文
共 42 条
  • [1] Sampling and approximation techniques for efficient process conformance checking
    Bauer, Martin
    van der Aa, Han
    Weidlich, Matthias
    [J]. INFORMATION SYSTEMS, 2022, 104
  • [2] Probabilistic Trace Alignment
    Bergami, Giacomo
    Maggi, Fabrizio Maria
    Montali, Marco
    Penaloza, Rafael
    [J]. 2021 3RD INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2021), 2021, : 9 - 16
  • [3] Self-consistent method for density estimation
    Bernacchia, Alberto
    Pigolotti, Simone
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2011, 73 : 407 - 422
  • [4] Conformance Checking over Stochastically Known Logs
    Bogdanov, Eli
    Cohen, Izack
    Gal, Avigdor
    [J]. BUSINESS PROCESS MANAGEMENT FORUM, 2022, 458 : 105 - 119
  • [5] Formal Modeling and SMT-Based Parameterized Verification of Data-Aware BPMN
    Calvanese, Diego
    Ghilardi, Silvio
    Gianola, Alessandro
    Montali, Marco
    Rivkin, Andrey
    [J]. BUSINESS PROCESS MANAGEMENT (BPM 2019), 2019, 11675 : 157 - 175
  • [6] SMT-based verification of data-aware processes: a model-theoretic approach
    Calvanese, Diego
    Ghilardi, Silvio
    Gianola, Alessandro
    Montali, Marco
    Rivkin, Andrey
    [J]. MATHEMATICAL STRUCTURES IN COMPUTER SCIENCE, 2020, 30 (03) : 271 - 313
  • [7] Carmona Josep, 2018, Conformance Checking
  • [8] De Leoni M, 2015, 4TU.ResearchData
  • [9] de Leoni M., 2013, P 13 SAC
  • [10] Integrating BPMN and DMN: Modeling and Analysis
    de Leoni, Massimiliano
    Felli, Paolo
    Montali, Marco
    [J]. JOURNAL ON DATA SEMANTICS, 2021, 10 (1-2) : 165 - 188