act2vec, trace2vec, log2vec, and model2vec: Representation Learning for Business Processes

被引:54
作者
De Koninck, Pieter [1 ]
vanden Broucke, Seppe [1 ]
De Weerdt, Jochen [1 ]
机构
[1] Katholieke Univ Leuven, Dept Decis Sci & Informat Management, Fac Econ & Business, Leuven, Belgium
来源
BUSINESS PROCESS MANAGEMENT (BPM 2018) | 2018年 / 11080卷
关键词
Representation learning; Process mining; Word embedding;
D O I
10.1007/978-3-319-98648-7_18
中图分类号
F [经济];
学科分类号
02 ;
摘要
In process mining, the challenge is typically to turn raw event data into meaningful models, insights, or actions. One of the key problems of a data-driven analysis of processes, is the high dimensionality of the data. In this paper, we address this problem by developing representation learning techniques for business processes. More specifically, the representation learning paradigm is applied to activities, traces, logs, and models in order to learn highly informative but low-dimensional vectors, often referred to as embeddings, based on a neural network architecture. Subsequently, these vectors can be used for automated inference tasks such as trace clustering, process comparison, predictive process monitoring, anomaly detection, etc. Accordingly, the main contribution of this paper is the proposal of representation learning architectures at the level of activities, traces, logs, and models that can produce a distributed representation of these objects and a thorough analysis of potential applications. In an experimental evaluation, we show the power of such derived representations in the context of trace clustering and process model comparison.
引用
收藏
页码:305 / 321
页数:17
相关论文
共 35 条
[1]  
[Anonymous], 2013, EFFICIENT ESTIMATION
[2]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[3]  
Bose R. J. C., 2009, Proceedings of the SIAM International Conference on Data Mining SDM 2009, P401, DOI DOI 10.1137/1.9781611972795.35
[4]  
Bose RPJC, 2010, LECT NOTES BUS INF P, V43, P170
[5]  
Bose RPJC, 2009, LECT NOTES COMPUT SC, V5701, P159, DOI 10.1007/978-3-642-03848-8_12
[6]  
Burattin A., 2016, BPM (Demos), P1
[7]  
Cho K., 2014, P 2014 C EMP METH NA, P1724
[8]   Active Trace Clustering for Improved Process Discovery [J].
De Weerdt, Jochen ;
Vanden Broucke, Seppe ;
Vanthienen, Jan ;
Baesens, Bart .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (12) :2708-2720
[9]   Similarity of business process models: Metrics and evaluation [J].
Dijkman, Remco ;
Dumas, Marlon ;
van Dongen, Boudewijn ;
Kaeaerik, Reina ;
Mendling, Jan .
INFORMATION SYSTEMS, 2011, 36 (02) :498-516
[10]  
Dijkman R, 2009, LECT NOTES COMPUT SC, V5701, P48, DOI 10.1007/978-3-642-03848-8_5