Event-Log Abstraction using Batch Session Identification and Clustering

被引:25
作者
de Leoni, Massimiliano [1 ]
Dundar, Safa [2 ]
机构
[1] Univ Padua, Dept Math, Padua, Italy
[2] Micro Focus, Utrecht, Netherlands
来源
PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20) | 2020年
关键词
Process Discovery; Event Log Abstraction; Clustering; Flexible Processes; Visual Analytics;
D O I
10.1145/3341105.3373861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process-Mining techniques aim to use event data about past executions to gain insight into how processes are executed. While these techniques are proven to be very valuable, they are less successful to reach their goal if the process is flexible and, hence, it exhibits an extremely large number of variants. Furthermore, information systems can record events at very low level, which do not match the high-level concepts known at business level. Without abstracting sequences of events to high-level concepts, the results of applying process mining (to, e.g., discover a model) easily become very complex and difficult to interpret, which ultimately means that they are of little use. A large body of research exists on event abstraction but typically a large amount of domain knowledge is required, which is often not readily available. Other abstraction techniques are unsupervised, which ultimately return less accurate results and/or rely on stronger assumptions. This paper puts forward a technique that requires limited domain knowledge that can be easily provided. Traces are divided in batch sessions, and each session is abstracted as one single high-level activity execution. The abstraction is based on a combination of automatic clustering and visualization methods. The technique was assessed on two case studies about processes characterized by high variability. The results clearly illustrate the benefits of the abstraction to convey accurate knowledge to stakeholders.
引用
收藏
页码:36 / 44
页数:9
相关论文
共 21 条
  • [1] [Anonymous], 2010, LECT NOTES BUS INF
  • [2] [Anonymous], 2019, ABS190203616 CORR
  • [3] Baier Thomas, 2015, THESIS U POTSDAM
  • [4] de Leoni Massimiliano, 2019, LOW LEVEL EVENTS ACT
  • [5] De Weerdt Jochen, 2018, TRACE CLUSTERING, DOI 10.1007/978-3-319-63962-8_91-1
  • [6] A Probabilistic Unified Framework for Event Abstraction and Process Detection from Log Data
    Fazzinga, Bettina
    Flesca, Sergio
    Furfaro, Filippo
    Masciari, Elio
    Pontieri, Luigi
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2015 CONFERENCES, 2015, 9415 : 320 - 328
  • [7] Ferreira Diogo R., 2013, International Journal of Business Process Integration and Management, V6, P146
  • [8] Ketchen DJ, 1996, STRATEGIC MANAGE J, V17, P441, DOI 10.1002/(SICI)1097-0266(199606)17:6<441::AID-SMJ819>3.0.CO
  • [9] 2-G
  • [10] Discovery of Frequent Episodes in Event Logs
    Leemans, Maikel
    van der Aalst, Wil M. P.
    [J]. DATA-DRIVEN PROCESS DISCOVERY AND ANALYSIS, SIMPDA 2014, 2015, 237 : 1 - 31