A Novel Business Process Prediction Model Using a Deep Learning Method

被引:62
作者
Mehdiyev N. [1 ,2 ]
Evermann J. [3 ]
Fettke P. [1 ,2 ]
机构
[1] Institute for Information Systems (IWi), German Research Center for Artificial Intelligence (DFKI), Campus D3.2, Saarbruecken
[2] Saarland University, Saarbruecken
[3] Memorial University of Newfoundland, 310 Elizabeth Avenue, St. John’s, A1B 3X5, NL
关键词
Deep learning; Feature hashing; N-grams; Process prediction; Stacked autoencoders;
D O I
10.1007/s12599-018-0551-3
中图分类号
学科分类号
摘要
The ability to proactively monitor business processes is a main competitive differentiator for firms. Process execution logs generated by process aware information systems help to make process specific predictions for enabling a proactive situational awareness. The goal of the proposed approach is to predict the next process event from the completed activities of the running process instance, based on the execution log data from previously completed process instances. By predicting process events, companies can initiate timely interventions to address undesired deviations from the desired workflow. The paper proposes a multi-stage deep learning approach that formulates the next event prediction problem as a classification problem. Following a feature pre-processing stage with n-grams and feature hashing, a deep learning model consisting of an unsupervised pre-training component with stacked autoencoders and a supervised fine-tuning component is applied. Experiments on a variety of business process log datasets show that the multi-stage deep learning approach provides promising results. The study also compared the results to existing deep recurrent neural networks and conventional classification approaches. Furthermore, the paper addresses the identification of suitable hyperparameters for the proposed approach, and the handling of the imbalanced nature of business process event datasets. © 2018, Springer Fachmedien Wiesbaden GmbH, part of Springer Nature.
引用
收藏
页码:143 / 157
页数:14
相关论文
共 65 条
[1]  
Barga R., Fontama V., Tok W.H., Cabrera-Cordon L., Predictive analytics with Microsoft Azure machine learning, (2015)
[2]  
Bergstra J.S., Bardenet R., Bengio Y., Kegl B., Algorithms for hyper-parameter optimization, Advances in neural information processing systems, pp. 2546-2554, (2011)
[3]  
Bergstra J., Bengio Y., Random search for hyper-parameter optimization, J Mach Learn Res, 13, 1, pp. 281-305, (2012)
[4]  
Bose R.P.J.C., van Der Aalst W.M.P., Zliobaite I., Pechenizkiy M., Handling concept drift in process mining, International Conference on Advanced Information Systems Engineering, pp. 391-405, (2011)
[5]  
Bradley A.P., The use of the area under the ROC curve in the evaluation of machine learning algorithms, Pattern Recognit, 30, 7, pp. 1145-1159, (1997)
[6]  
Breuker D., Matzner M., Delfmann P., Becker J., Comprehensible predictive models for business processes, MIS Q, 40, 4, pp. 1009-1034, (2016)
[7]  
Candel A., Parmar V., LeDell E., Arora A., Deep learning with h2o, (2016)
[8]  
Caragea C., Silvescu A., Mitra P., Protein sequence classification using feature hashing, Proteome Sci, 10, 1, pp. 1-14, (2012)
[9]  
Caruana R., Karampatziakis N., Yessenalina A., An empirical evaluation of supervised learning in high dimensions, 25Th International Conference on Machine Learning, pp. 96-103, (2008)
[10]  
Caruana R., Niculescu-Mizil A., An empirical comparison of supervised learning algorithms, 23Rd International Conference on Machine Learning, pp. 161-168, (2006)