Extracting Domain Models from Textual Requirements in the Era of Large Language Models

被引:11
作者
Arulmohan, Sathurshan [1 ,2 ]
Meurs, Marie-Jean [3 ]
Mosser, Sebastien [1 ,2 ]
机构
[1] McMaster Univ, CAS, Hamilton, ON, Canada
[2] McSCert, Hamilton, ON, Canada
[3] Univ Quebec Montreal, CIRST, Montreal, PQ, Canada
来源
2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C | 2023年
基金
加拿大自然科学与工程研究理事会;
关键词
Domain Modeling; Natural Language Processing; Large Language Models; Concept Extraction; User stories;
D O I
10.1109/MODELS-C59198.2023.00096
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Requirements Engineering is a critical part of the software lifecycle, describing what a given piece of software will do (functional) and how it will do it (non-functional). Requirements documents are often textual, and it is up to software engineers to extract the relevant domain models from the text, which is an error-prone and time-consuming task. Considering the recent attention gained by Large Language Models (LLMs), we explored how they could support this task. This paper investigates how such models can be used to extract domain models from agile product backlogs and compare them to (i) a state-of-practice tool as well as (ii) a dedicated Natural Language Processing (NLP) approach, on top of a reference dataset of 22 products and 1, 679 user stories. Based on these results, this paper is a first step towards using LLMs and/or tailored NLP to support automated requirements engineering thanks to model extraction using artificial intelligence.
引用
收藏
页码:580 / 587
页数:8
相关论文
共 17 条
  • [1] Arulmohan Sathurshan, 2023, Zenodo, DOI 10.5281/ZENODO.8136975
  • [2] MUCE: a multilingual use case model extractor using GPT-3
    Bajaj D.
    Goel A.
    Gupta S.C.
    Batra H.
    [J]. International Journal of Information Technology, 2022, 14 (3) : 1543 - 1554
  • [3] On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
    Bender, Emily M.
    Gebru, Timnit
    McMillan-Major, Angelina
    Shmitchell, Shmargaret
    [J]. PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 610 - 623
  • [4] Brown TB, 2020, ADV NEUR IN, V33
  • [5] Cohn M., 2004, USER STORIES APPL AG
  • [6] Dalpiaz F., 2018, Requirements Data Sets (User Stories)
  • [7] Natural Language Processing for Requirements Engineering The Best Is Yet to Come
    Dalpiaz, Fabiano
    Ferrari, Alessio
    Franch, Xavier
    Palomares, Cristina
    [J]. IEEE SOFTWARE, 2018, 35 (05) : 115 - 119
  • [8] Honnibal M., 2017, SPACY 2 NATURAL LANG
  • [9] Korobov Mikhail, 2023, Zenodo, DOI 10.5281/ZENODO.8132729
  • [10] Lafferty J., 2001, CONDITIONAL RANDOM F