Multi-task prediction method of business process based on BERT and Transfer Learning

被引:21
作者
Chen, Hang [1 ,2 ]
Fang, Xianwen [1 ,2 ]
Fang, Huan [1 ]
机构
[1] Anhui Univ Sci & Technol, Sch Math & Big Data, Huainan, Peoples R China
[2] Anhui Prov Engn Lab Big Data Anal & Early Warning, Huainan, Peoples R China
关键词
Predictive business process monitoring; Transfer Learning; Transformer; BERT; Masked Activity Model; NEURAL-NETWORKS; CLASSIFIERS;
D O I
10.1016/j.knosys.2022.109603
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predictive Business Process Monitoring (PBPM) is one of the essential tasks in Business Process Management (BPM). It aims to predict the future behavior of an ongoing case using completed cases of a process stored in the event log, such as the prediction of the next activity and outcome of the case, etc. Although various deep learning methods have been proposed for PBPM, none of them consider the simultaneous application to multiple predictive tasks. This paper proposes a multi-task prediction method based on BERT and Transfer Learning. First, the method performs the Masked Activity Model (MAM) of a self-supervised pre-training task on many unlabeled traces using BERT (Bidirectional Encoder Representations from Transformers). The pre-training task MAM captures the bidirectional semantic information of the input traces using the bidirectional Transformer structure in BERT. It obtains the long-term dependencies between activities using the Attention mechanism in the Transformer. Then, the universal representation model of the traces is obtained. Finally, two different models are defined for two prediction tasks of the next activity and the outcome of the case, respectively, and the pre-trained model is transferred to the two prediction models for training using the fine-tuning strategy. Experiments evaluation on eleven real-world event logs shows that the performance of the prediction tasks is affected by different masking tactics and masking probabilities in the pre-training task MAM. This method performs well in the next activity prediction task and the case outcome prediction task. It can be applied to several different prediction tasks faster and with more outstanding performance than the direct training method. (C) 2022 Published by Elsevier B.V.
引用
收藏
页数:15
相关论文
共 34 条
  • [11] Predicting process behaviour using deep learning
    Evermann, Joerg
    Rehse, Jana-Rebecca
    Fettke, Peter
    [J]. DECISION SUPPORT SYSTEMS, 2017, 100 : 129 - 140
  • [12] Learning Effective Neural Nets for Outcome Prediction from Partially Labelled Log Data
    Folino, Francesco
    Folino, Gianluigi
    Guarascio, Massimo
    Pontieri, Luigi
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1396 - 1400
  • [14] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
  • [15] NEURAL NETWORKS AND PHYSICAL SYSTEMS WITH EMERGENT COLLECTIVE COMPUTATIONAL ABILITIES
    HOPFIELD, JJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1982, 79 (08): : 2554 - 2558
  • [16] HAM-Net: Predictive Business Process Monitoring with a hierarchical attention mechanism
    Jalayer, Abdulrahman
    Kahani, Mohsen
    Pourmasoumi, Asef
    Beheshti, Amin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 236
  • [17] Kingma DP, 2014, ADV NEUR IN, V27
  • [18] A markov prediction model for data-driven semi-structured business processes
    Lakshmanan, Geetika T.
    Shamsi, Davood
    Doganata, Yurdaer N.
    Unuvar, Merve
    Khalaf, Rania
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 42 (01) : 97 - 126
  • [19] INTERPRETABLE CLASSIFIERS USING RULES AND BAYESIAN ANALYSIS: BUILDING A BETTER STROKE PREDICTION MODEL
    Letham, Benjamin
    Rudin, Cynthia
    McCormick, Tyler H.
    Madigan, David
    [J]. ANNALS OF APPLIED STATISTICS, 2015, 9 (03) : 1350 - 1371
  • [20] Maggi FM, 2014, LECT NOTES COMPUT SC, V8484, P457, DOI 10.1007/978-3-319-07881-6_31