Process discovery from event data: Relating models and logs through abstractions

被引:34
|
作者
van der Aalst, Wil M. P. [1 ]
机构
[1] Rhein Westfal TH Aachen, Proc & Data Sci PADS, Aachen, Germany
关键词
business process management; data science; process discovery; process mining; process modeling; MINING PROCESS MODELS; OF-THE-ART;
D O I
10.1002/widm.1244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event data are collected in logistics, manufacturing, finance, health care, customer relationship management, e-learning, e-government, and many other domains. The events found in these domains typically refer to activities executed by resources at particular times and for a particular case (i.e., process instances). Process mining techniques are able to exploit such data. In this article, we focus on process discovery. However, process mining also includes conformance checking, performance analysis, decision mining, organizational mining, predictions, recommendations, and so on. These techniques help to diagnose problems and improve processes. All process mining techniques involve both event data and process models. Therefore, a typical first step is to automatically learn a control-flow model from the event data. This is very challenging, but in recent years, many powerful discovery techniques have been developed. It is not easy to compare these techniques since they use different representations and make different assumptions. Users often need to resort to trying different algorithms in an ad-hoc manner. Developers of new techniques are often trying to solve specific instances of a more general problem. Therefore, we aim to unify existing approaches by focusing on log and model abstractions. These abstractions link observed and modeled behavior: Concrete behaviors recorded in event logs are related to possible behaviors represented by process models. Hence, such behavioral abstractions provide an interface between both of them. We discuss four discovery approaches involving three abstractions and different types of process models (Petri nets, block-structured models, and declarative models). The goal is to provide a comprehensive understanding of process discovery and show how to develop new techniques. Examples illustrate the different approaches and pointers to software are given. The discussion on abstractions and process representations is also presented to reflect on the gap between process mining literature and commercial process mining tools. This facilitates users to select an appropriate process discovery technique. Moreover, structuring the role of internal abstractions and representations helps broaden the view and facilitates the creation of new discovery approaches. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Application Areas > Business and Industry Technologies > Machine Learning Application Areas > Data Mining Software Tools
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Using Event Logs for Local Correction of Process Models
    Mitsyuk A.A.
    Lomazova I.A.
    van der Aalst W.M.P.
    Automatic Control and Computer Sciences, 2017, 51 (7) : 709 - 723
  • [32] Repairing Event Logs Using Timed Process Models
    Rogge-Solti, Andreas
    Mans, Ronny S.
    van der Aalst, Wil M. P.
    Weske, Mathias
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2013 WORKSHOPS, 2013, 8186 : 705 - 708
  • [33] Utility-Based Control Flow Discovery from Business Process Event Logs
    Anand, Kritika
    Gupta, Nisha
    Sureka, Ashish
    BIG DATA ANALYTICS, BDA 2015, 2015, 9498 : 69 - 83
  • [34] Process Model Discovery from Sensor Event Data
    Janssen, Dominik
    Mannhardt, Felix
    Koschmider, Agnes
    van Zelst, Sebastiaan J.
    PROCESS MINING WORKSHOPS, ICPM 2020 INTERNATIONAL WORKSHOPS, 2021, 406 : 69 - 81
  • [35] Information-preserving abstractions of event data in process mining
    Leemans, Sander J. J.
    Fahland, Dirk
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (03) : 1143 - 1197
  • [36] Information-preserving abstractions of event data in process mining
    Sander J. J. Leemans
    Dirk Fahland
    Knowledge and Information Systems, 2020, 62 : 1143 - 1197
  • [37] A Reference Data Model to Specify Event Logs for Big Data Pipeline Discovery
    Benvenuti, Dario
    Marrella, Andrea
    Rossi, Jacopo
    Nikolov, Nikolay
    Roman, Dumitru
    Soylu, Ahmet
    Perales, Fernando
    BUSINESS PROCESS MANAGEMENT FORUM, BPM 2023 FORUM, 2023, 490 : 38 - 54
  • [38] Online Discovery of Declarative Process Models from Event Streams
    Burattin, Andrea
    Cimitile, Marta
    Maggi, Fabrizio M.
    Sperduti, Alessandro
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2015, 8 (06) : 833 - 846
  • [39] Generating event logs for high-level process models
    Mitsyuk, Alexey A.
    Shugurov, Ivan S.
    Kalenkova, Anna A.
    van der Aalst, Wil M. P.
    SIMULATION MODELLING PRACTICE AND THEORY, 2017, 74 : 1 - 16
  • [40] Aligning event logs and process models based on Petri nets
    Tian Y.
    Du Y.
    Han D.
    Liu W.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (04): : 809 - 829