Towards an Entropy-Based Analysis of Log Variability

被引:3
作者
Back, Christoffer Olling [1 ]
Debois, Soren [2 ]
Slaats, Tijs [1 ]
机构
[1] Univ Copenhagen, Dept Comp Sci, Emil Holms Kanal 6, DK-2300 Copenhagen, Denmark
[2] IT Univ Copenhagen, Dept Comp Sci, Rued Langgaards Vej 7, DK-2300 Copenhagen, Denmark
来源
BUSINESS PROCESS MANAGEMENT WORKSHOPS (BPM 2017) | 2018年 / 308卷
关键词
Process mining; Hybrid models; Process variability Process flexibility; Information theory; Entropy; Knowledge Work; MODELS;
D O I
10.1007/978-3-319-74030-0_4
中图分类号
F [经济];
学科分类号
02 ;
摘要
Process mining algorithms can be partitioned by the type of model that they output: imperative miners output flow-diagrams showing all possible paths through a process, whereas declarative miners output constraints showing the rules governing a process. For processes with great variability, the latter approach tends to provide better results, because using an imperative miner would lead to so-called "spaghetti models" which attempt to show all possible paths and are impossible to read. However, studies have shown that one size does not fit all: many processes contain both structured and unstructured parts and therefore do not fit strictly in one category or the other. This has led to the recent introduction of hybrid miners, which aim to combine flow-and constraint-based models to provide the best possible representation of a log. In this paper we focus on a core question underlying the development of hybrid miners: given a log, can we determine a priori whether the log is best suited for imperative or declarative mining? We propose using the concept of entropy, commonly used in information theory. We consider different measures for entropy that could be applied and show through experimentation on both synthetic and real-life logs that these entropy measures do indeed give insights into the complexity of the log and can act as an indicator of which mining paradigm should be used.
引用
收藏
页码:53 / 70
页数:18
相关论文
共 31 条
  • [1] Bansal N, 2002, ANN IEEE SYMP FOUND, P238, DOI 10.1109/SFCS.2002.1181947
  • [2] Bishop Christopher M, 2016, Pattern recognition and machine learning
  • [3] Bose R. J. C., 2009, Proceedings of the SIAM International Conference on Data Mining SDM 2009, P401, DOI DOI 10.1137/1.9781611972795.35
  • [4] COMPREHENSIBLE PREDICTIVE MODELS FOR BUSINESS PROCESSES
    Breuker, Dominic
    Matzner, Martin
    Delfmann, Patrick
    Becker, Joerg
    [J]. MIS QUARTERLY, 2016, 40 (04) : 1009 - +
  • [5] Buijs J.C.A.M., 2012, On the Move to Meaningful Internet Systems: OTM 2012, P305, DOI DOI 10.1007/978-3-642-33606-5_19
  • [6] Fusion Miner: Process discovery for mixed-paradigm models
    De Smedt, Johannes
    De Weerdt, Jochen
    Vanthienen, Jan
    [J]. DECISION SUPPORT SYSTEMS, 2015, 77 : 123 - 136
  • [7] Debois S., 2018, BUSINESS PROCESS MAN, P397, DOI DOI 10.1007/978-3-319-58307-521
  • [8] The Analysis of a Real Life Declarative Process
    Debois, Soren
    Slaats, Tijs
    [J]. 2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 1374 - 1382
  • [9] Safety, Liveness and Run-Time Refinement for Modular Process-Aware Information Systems with Dynamic Sub Processes
    Debois, Soren
    Hildebrandt, Thomas
    Slaats, Tijs
    [J]. FM 2015: FORMAL METHODS, 2015, 9109 : 143 - 160
  • [10] Delias P., 2017, INT T OPER RES