Leveraging Generative Vision Models for Extracting Process Models from Documents

被引:0
作者
Voelter, Marvin [1 ,2 ]
Hadian, Raheleh [1 ]
Kampik, Timotheus [1 ]
Breitmayer, Marius [2 ]
Reichert, Manfred [2 ]
机构
[1] SAP, Berlin, Germany
[2] Ulm Univ, Ulm, Germany
来源
BUSINESS PROCESS MANAGEMENT WORKSHOPS, BPM 2024 | 2025年 / 534卷
关键词
Generative Vision Models; Multimodal Large Language Models; Document Analysis; Business Process Management;
D O I
10.1007/978-3-031-78666-2_21
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper investigates the vision capabilities of multimodal Generative Pre-trained Transformers (GPTs) to auto-generate structured process models from diagram- and text-based documents. We introduce a dataset of 123 process models and corresponding documentation, emphasizing real-world element distributions. Using evaluation metrics for process model similarity, this enables ground truth-based assessment of process model generation. We evaluate commercial GPT capabilities with zero-, one-, and few-shot prompting strategies. Our results indicate that generative vision models can be useful tools for semi-automated process modeling based on multimodal documents. More importantly, the dataset and evaluation metrics as well as the open-source evaluation code provide a structured framework for continued systematic evaluations moving forward.
引用
收藏
页码:271 / 282
页数:12
相关论文
共 28 条
  • [1] Antinori A., P BEST DISS AW DOCT
  • [2] Badakhshan P., 2022, Lecture Notes in Business Information Processing
  • [3] Beheshti Amin, 2023, 2023 IEEE International Conference on Web Services (ICWS), P731, DOI 10.1109/ICWS60048.2023.00099
  • [4] Bellan P., 2022, arXiv, DOI [10.48550/ARXIV.2204.03542, DOI 10.48550/ARXIV.2204.03542]
  • [5] PET: An Annotated Dataset for Process Extraction from Natural Language Text Tasks
    Bellan, Patrizio
    van der Aa, Han
    Dragoni, Mauro
    Ghidini, Chiara
    Ponzetto, Simone Paolo
    [J]. BUSINESS PROCESS MANAGEMENT WORKSHOPS, BPM 2022 INTERNATIONAL WORKSHOPS, 2023, 460 : 315 - 321
  • [6] Brown TB, 2020, ADV NEUR IN, V33
  • [7] Carmona J., 2018, P 27 INT C COMP LING, P2791
  • [8] Trends on the Usage of BPMN 2.0 from Publicly Available Repositories
    Compagnucci, Ivan
    Corradini, Flavio
    Fornari, Fabrizio
    Re, Barbara
    [J]. PERSPECTIVES IN BUSINESS INFORMATICS RESEARCH, BIR 2021, 2021, 430 : 84 - 99
  • [9] MEASURES OF THE AMOUNT OF ECOLOGIC ASSOCIATION BETWEEN SPECIES
    DICE, LR
    [J]. ECOLOGY, 1945, 26 (03) : 297 - 302
  • [10] Similarity of business process models: Metrics and evaluation
    Dijkman, Remco
    Dumas, Marlon
    van Dongen, Boudewijn
    Kaeaerik, Reina
    Mendling, Jan
    [J]. INFORMATION SYSTEMS, 2011, 36 (02) : 498 - 516