An empirical study of business process models and model clones on GitHub

被引:0
作者
Nikoo, Mahdi Saeedi [1 ]
Kochanthara, Sangeeth [1 ]
Babur, Onder [1 ,2 ]
van den Brand, Mark [1 ]
机构
[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] Wageningen Univ & Res, Informat Technol Grp, Wageningen, Netherlands
关键词
Business process modeling; BPMN; Mining software repositories; Model analytics; Model clone detection; CODE; BPMN;
D O I
10.1007/s10664-024-10584-z
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Business process management entails a multi-billion-dollar industry that is founded on modeling business processes to analyze, understand, improve, and automate them. Business processes consist of a set of interconnected activities that an organization follows to achieve its goals and objectives. While the existence of business process models in open source has been reported in the literature, there is little work in characterizing their landscape. This paper presents the first characterization of business process models in open source, particularly on GitHub. The landscape is formed by 25,866 business process models across 4,954 repositories, with 16% of the repositories belonging to organizations. We discover that models belong to at least 16 domains including traditional software, machine learning, sales, business services, and financial services. These models are created using at least 28 different tools. Our exploration into cloning among the models shows that about 90% of all models are clones of each other. Application domains such as machine learning, traditional software, and business services demonstrate a higher occurrence of clones while in another dimension, clones are found across more repositories owned by industry as compared to those owned by academia. Also, contrary to code clones, we find that the majority of process model cloning occurs across multiple repositories. While our study acts as a precursor for future efforts to develop effective modeling practices in the field of business processes, it also emphasizes the need to address cloning and its implications in the context of reuse, maintenance, and modeling approaches.
引用
收藏
页数:46
相关论文
共 137 条
  • [1] Activiti, 2023, ABOUT US
  • [2] Alalfi MH, 2012, 2012 28TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), P295, DOI 10.1109/ICSM.2012.6405285
  • [3] The Adverse Effects of Code Duplication in Machine Learning Models of Code
    Allamams, Miltiadis
    [J]. PROCEEDINGS OF THE 2019 ACM SIGPLAN INTERNATIONAL SYMPOSIUM ON NEW IDEAS, NEW PARADIGMS, AND REFLECTIONS ON PROGRAMMING AND SOFTWARE (ONWARD!' 19), 2019, : 143 - 153
  • [4] Recommender systems in model-driven engineering A systematic mapping review
    Almonte, Lissette
    Guerra, Esther
    Cantador, Ivan
    de Lara, Juan
    [J]. SOFTWARE AND SYSTEMS MODELING, 2022, 21 (01) : 249 - 280
  • [5] [Anonymous], 2010, P 4 INT WORKSH SOFTW, DOI [10.1145/1808901.1808909, DOI 10.1145/1808901.1808909]
  • [6] April A., 2012, Software Maintenance Management: Evaluation and Continuous Improvement
  • [7] Babur O, 2018, P MODELS 2018 WORKSH, P787
  • [8] Babur O, 2019, Model analytics and management
  • [9] SAMOS- A framework for model analytics and management
    Babur, Oender
    Cleophas, Loek
    van den Brand, Mark
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2022, 223
  • [10] Language usage analysis for EMF metamodels on GitHub
    Babur, Onder
    Constantinou, Eleni
    Serebrenik, Alexander
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (01)