Sampling business process event logs with guarantees

被引:0
|
作者
Su, Xuan [1 ]
Liu, Cong [1 ,2 ,4 ]
Zhang, Shuaipeng [3 ]
Zeng, Qingtian [2 ]
机构
[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao, Peoples R China
[3] Shandong Univ, Sch Software, Jinan, Peoples R China
[4] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255000, Peoples R China
来源
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2024年 / 36卷 / 13期
关键词
process mining; model discovery; event log sampling; behavior equivalence; efficiency; PROCESS MODELS; DISCOVERY;
D O I
10.1002/cpe.8077
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Event log sampling has emerged as a key research focus in the field of process mining, aiming to enhance the efficiency of various process mining tasks, including model discovery, conformance checking, and process prediction. However, current log sampling techniques often fail to ensure high-quality sample logs. This paper introduces a novel framework to support efficient event log sampling without compromising the quality of the sample log compared to the original one. The approach revolves around the consideration of directly-follows relation (DFR) among business tasks as the fundamental behavior unit of an event log. By ensuring the DFR equivalence between the original and sample logs, the proposed technique addresses the challenge of sample log quality from the model discovery point of view. The framework is instantiated by seven distinct sampling strategies each has its own specialty and is fully implemented in the open-source process mining tool platform ProM. To validate its effectiveness, we conducted a comprehensive experimental evaluation using 12 publicly available real-life event logs against state-of-the-art sampling techniques. The results clearly demonstrate that our technique significantly improves model discovery efficiency while upholding high quality of the discovered models.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Optimal Process Mining for Large and Complex Event Logs
    Prodel, Martin
    Augusto, Vincent
    Jouaneton, Baptiste
    Lamarsalle, Ludovic
    Xie, Xiaolan
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (03) : 1309 - 1325
  • [22] Split Miner: Discovering Accurate and Simple Business Process Models from Event Logs
    Augusto, Adriano
    Conforti, Raffaele
    Dumas, Marlon
    La Rosa, Marcello
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 1 - 10
  • [23] A Method Towards Cross-Organizational Business Process Modeling from Event Logs
    Fang, Xi
    Tan, Wenan
    Zhao, Lu
    12TH CHINESE CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING (CHINESECSCW 2017), 2017, : 193 - 196
  • [24] A Deep Learning Approach for Repairing Missing Activity Labels in Event Logs for Process Mining
    Lu, Yang
    Chen, Qifan
    Poon, Simon K.
    INFORMATION, 2022, 13 (05)
  • [25] Split miner: automated discovery of accurate and simple business process models from event logs
    Adriano Augusto
    Raffaele Conforti
    Marlon Dumas
    Marcello La Rosa
    Artem Polyvyanyy
    Knowledge and Information Systems, 2019, 59 : 251 - 284
  • [26] Generating event logs for high-level process models
    Mitsyuk, Alexey A.
    Shugurov, Ivan S.
    Kalenkova, Anna A.
    van der Aalst, Wil M. P.
    SIMULATION MODELLING PRACTICE AND THEORY, 2017, 74 : 1 - 16
  • [27] Using Event Logs and the ψ-theory to Analyse Business Processes
    Pinto, Pedro Linares
    Mendes, Carlos
    da Silva, Miguel Mira
    Caetano, Artur
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 1195 - 1202
  • [28] A Method to Tackle Abnormal Event Logs Based on Process Mining
    Yang, Zhanmin
    Zhang, Liqun
    Hu, Yuan
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, KNOWLEDGE ENGINEERING AND INFORMATION ENGINEERING (SEKEIE 2014), 2014, 114 : 34 - 38
  • [29] Split miner: automated discovery of accurate and simple business process models from event logs
    Augusto, Adriano
    Conforti, Raffaele
    Dumas, Marlon
    La Rosa, Marcello
    Polyvyanyy, Artem
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (02) : 251 - 284
  • [30] Aligning event logs and process models based on Petri nets
    Tian Y.
    Du Y.
    Han D.
    Liu W.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (04): : 809 - 829