Multi-task prediction method of business process based on BERT and Transfer Learning

被引：21

作者：

Chen, Hang ^{[1
,2
]}

Fang, Xianwen ^{[1
,2
]}

Fang, Huan ^{[1
]}

机构：

[1] Anhui Univ Sci & Technol, Sch Math & Big Data, Huainan, Peoples R China

[2] Anhui Prov Engn Lab Big Data Anal & Early Warning, Huainan, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2022年 / 254卷

关键词：

Predictive business process monitoring; Transfer Learning; Transformer; BERT; Masked Activity Model; NEURAL-NETWORKS; CLASSIFIERS;

D O I：

10.1016/j.knosys.2022.109603

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Predictive Business Process Monitoring (PBPM) is one of the essential tasks in Business Process Management (BPM). It aims to predict the future behavior of an ongoing case using completed cases of a process stored in the event log, such as the prediction of the next activity and outcome of the case, etc. Although various deep learning methods have been proposed for PBPM, none of them consider the simultaneous application to multiple predictive tasks. This paper proposes a multi-task prediction method based on BERT and Transfer Learning. First, the method performs the Masked Activity Model (MAM) of a self-supervised pre-training task on many unlabeled traces using BERT (Bidirectional Encoder Representations from Transformers). The pre-training task MAM captures the bidirectional semantic information of the input traces using the bidirectional Transformer structure in BERT. It obtains the long-term dependencies between activities using the Attention mechanism in the Transformer. Then, the universal representation model of the traces is obtained. Finally, two different models are defined for two prediction tasks of the next activity and the outcome of the case, respectively, and the pre-trained model is transferred to the two prediction models for training using the fine-tuning strategy. Experiments evaluation on eleven real-world event logs shows that the performance of the prediction tasks is affected by different masking tactics and masking probabilities in the pre-training task MAM. This method performs well in the next activity prediction task and the case outcome prediction task. It can be applied to several different prediction tasks faster and with more outstanding performance than the direct training method. (C) 2022 Published by Elsevier B.V.

引用

页数：15

共 34 条

[11] Predicting process behaviour using deep learning
Evermann, Joerg
Rehse, Jana-Rebecca
Fettke, Peter
[J]. DECISION SUPPORT SYSTEMS, 2017, 100 : 129 - 140
[12] Learning Effective Neural Nets for Outcome Prediction from Partially Labelled Log Data
Folino, Francesco
Folino, Gianluigi
Guarascio, Massimo
Pontieri, Luigi
[J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1396 - 1400
[13] The use of ranks to avoid the assumption of normality implicit in the analysis of variance
Friedman, M
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1937, 32 (200) : 675 - 701
[14] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
[15] NEURAL NETWORKS AND PHYSICAL SYSTEMS WITH EMERGENT COLLECTIVE COMPUTATIONAL ABILITIES
HOPFIELD, JJ
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1982, 79 (08): : 2554 - 2558
[16] HAM-Net: Predictive Business Process Monitoring with a hierarchical attention mechanism
Jalayer, Abdulrahman
Kahani, Mohsen
Pourmasoumi, Asef
Beheshti, Amin
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 236
[17] Kingma DP, 2014, ADV NEUR IN, V27
[18] A markov prediction model for data-driven semi-structured business processes
Lakshmanan, Geetika T.
Shamsi, Davood
Doganata, Yurdaer N.
Unuvar, Merve
Khalaf, Rania
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 42 (01) : 97 - 126
[19] INTERPRETABLE CLASSIFIERS USING RULES AND BAYESIAN ANALYSIS: BUILDING A BETTER STROKE PREDICTION MODEL
Letham, Benjamin
Rudin, Cynthia
McCormick, Tyler H.
Madigan, David
[J]. ANNALS OF APPLIED STATISTICS, 2015, 9 (03) : 1350 - 1371
[20] Maggi FM, 2014, LECT NOTES COMPUT SC, V8484, P457, DOI 10.1007/978-3-319-07881-6_31

← 1 2 3 4 →