Utilizing domain knowledge in data-driven process discovery: A literature review

被引:32
作者
Schuster, Daniel [1 ,2 ]
van Zelst, Sebastiaan J. [1 ,2 ]
van der Aalsta, Wil M. P. [1 ,2 ]
机构
[1] Fraunhofer FIT, Proc Min Res Grp, D-53757 St Augustin, North Rhine Wes, Germany
[2] Rhein Westfal TH Aachen, Chair Proc & Data Sci, Ahornstr 55, D-52074 Aachen, North Rhine Wes, Germany
关键词
Process mining; Process discovery; Process models; Human-in-the-loop; Hybrid intelligence; PROCESS MODELS; NETS;
D O I
10.1016/j.compind.2022.103612
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Process mining aims to improve operational processes in a data-driven manner. To this end, process mining offers methods and techniques for systematically analyzing event data. These data are generated during the execution of processes and stored in organizations' information systems. Process discovery, a key discipline in process mining, comprises techniques used to (automatically) learn a process model from event data. However, existing algorithms typically provide low-quality models from real-life event data due to data-quality issues and incompletely captured process behavior. Automated filtering of event data is valuable in obtaining better process models. At the same time, it is often too rigorous, i.e., it also removes valuable and correct data. In many cases, prior knowledge about the process under investigation can be additionally used for process discovery besides event data. Therefore, a new family of discovery algorithms has been developed that utilizes domain knowledge about the process in addition to event data. To organize this research, we present a literature review of process discovery approaches exploiting domain knowledge. We define a taxonomy that systematically classifies and compares existing approaches. Finally, we identify remaining challenges for future work. (C) 2022 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:19
相关论文
共 75 条
[1]  
Adriansyah A, 2014, THESIS EINDHOVEN U T, DOI [10.6100/IR770080, DOI 10.6100/IR770080]
[2]  
Armas Cervantes Abel, 2017, On the Move to Meaningful Internet Systems: OTM 2017 Conferences. Confederated International Conferences CoopIS, C&TC and ODBASE 2017. Proceedings: LNCS 10573, P53, DOI 10.1007/978-3-319-69462-7_5
[3]  
Armas-Cervantes A., 2017, CEUR WORKSHOP PROC, V1920
[4]   Automated Discovery of Process Models from Event Logs: Review and Benchmark [J].
Augusto, Adriano ;
Conforti, Raffaele ;
Dumas, Marlon ;
La Rosa, Marcello ;
Maggi, Fabrizio Maria ;
Marrella, Andrea ;
Mecella, Massimo ;
Soo, Allar .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (04) :686-705
[5]   Sampling and approximation techniques for efficient process conformance checking [J].
Bauer, Martin ;
van der Aa, Han ;
Weidlich, Matthias .
INFORMATION SYSTEMS, 2022, 104
[6]   Evaluating the Effectiveness of Interactive Process Discovery in Healthcare: A Case Study [J].
Benevento, Elisabetta ;
Dixit, Prabhakar M. ;
Sani, M. F. ;
Aloini, Davide ;
van der Aalst, Wil M. P. .
BUSINESS PROCESS MANAGEMENT WORKSHOPS (BPM 2019), 2019, 362 :508-519
[7]   A survey on educational process mining [J].
Bogarin, Alejandro ;
Cerezo, Rebeca ;
Romero, Cristobal .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (01)
[8]   Trace retrieval for business process operational support [J].
Bottrighi, Alessio ;
Canensi, Luca ;
Leonardi, Giorgio ;
Montani, Stefania ;
Terenziani, Paolo .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 55 :212-221
[9]   Quality Dimensions in Process Discovery: The Importance of Fitness, Precision, Generalization and Simplicity [J].
Buijs, J. C. A. M. ;
van Dongen, B. F. ;
van der Aalst, W. M. P. .
INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2014, 23 (01)
[10]   Multi-level Interactive Medical Process Mining [J].
Canensi, Luca ;
Leonardi, Giorgio ;
Montani, Stefania ;
Terenziani, Paolo .
ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2017, 2017, 10259 :256-260