Agile Methodology for the Standardization of Engineering Requirements Using Large Language Models

被引:8
作者
Ray, Archana Tikayat [1 ]
Cole, Bjorn F. [2 ]
Fischer, Olivia Pinon J. [1 ]
Bhat, Anirudh Prabhakara [3 ]
White, Ryan T. [4 ]
Mavris, Dimitri N. [1 ]
机构
[1] Georgia Inst Technol, Sch Aerosp Engn, Aerosp Syst Design Lab, Atlanta, GA 30332 USA
[2] Lockheed Martin Space, Littleton, CO 80127 USA
[3] Amazon, Toronto, ON M5H 4A9, Canada
[4] Florida Inst Technol, Dept Math Sci, Neural Transmiss Lab, Melbourne, FL 32901 USA
来源
SYSTEMS | 2023年 / 11卷 / 07期
关键词
requirements engineering; Large Language Models (LLMs); transformer-based language models; Natural Language Processing (NLP); BERT; requirement boilerplates; model-based systems engineering; requirement tables;
D O I
10.3390/systems11070352
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
The increased complexity of modern systems is calling for an integrated and comprehensive approach to system design and development and, in particular, a shift toward Model-Based Systems Engineering (MBSE) approaches for system design. The requirements that serve as the foundation for these intricate systems are still primarily expressed in Natural Language (NL), which can contain ambiguities and inconsistencies and suffer from a lack of structure that hinders their direct translation into models. The colossal developments in the field of Natural Language Processing (NLP), in general, and Large Language Models (LLMs), in particular, can serve as an enabler for the conversion of NL requirements into machine-readable requirements. Doing so is expected to facilitate their standardization and use in a model-based environment. This paper discusses a two-fold strategy for converting NL requirements into machine-readable requirements using language models. The first approach involves creating a requirements table by extracting information from free-form NL requirements. The second approach consists of an agile methodology that facilitates the identification of boilerplate templates for different types of requirements based on observed linguistic patterns. For this study, three different LLMs are utilized. Two of these models are fine-tuned versions of Bidirectional Encoder Representations from Transformers (BERTs), specifically, aeroBERT-NER and aeroBERT-Classifier, which are trained on annotated aerospace corpora. Another LLM, called flair/chunk-english, is utilized to identify sentence chunks present in NL requirements. All three language models are utilized together to achieve the standardization of requirements. The effectiveness of the methodologies is demonstrated through the semi-automated creation of boilerplates for requirements from Parts 23 and 25 of Title 14 Code of Federal Regulations (CFRs).
引用
收藏
页数:28
相关论文
共 39 条
[1]   SOFTWARE REQUIREMENTS AND SPECIFICATIONS - A SURVEY OF NEEDS AND LANGUAGES [J].
ABBOTT, RJ ;
MOORHEAD, DK .
JOURNAL OF SYSTEMS AND SOFTWARE, 1981, 2 (04) :297-316
[2]  
Akbik A., 2018, COLING 2018, 27th International Conference on Computational Linguistics, P1638
[3]  
Arora Chetan, 2013, 2013 ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), P35, DOI 10.1109/ESEM.2013.13
[4]  
Ballard M., 2020, AEROSP CONF PROC, P1
[5]  
Bell T. E., 1976, 2nd International Conference on Software Engineering, P61
[6]  
Bengio Y., 2000, Advances in Neural Information Processing Systems
[7]  
Brown T, 2020, Adv Neural Inf Process Syst, V33, P1877
[8]   Requirements Classification with Interpretable Machine Learning and Dependency Parsing [J].
Dalpiaz, Fabiano ;
Dell'Anna, Davide ;
Aydemir, Fatma Basak ;
Cevikol, Sercan .
2019 27TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2019), 2019, :142-152
[9]   Natural Language Processing for Requirements Engineering The Best Is Yet to Come [J].
Dalpiaz, Fabiano ;
Ferrari, Alessio ;
Franch, Xavier ;
Palomares, Cristina .
IEEE SOFTWARE, 2018, 35 (05) :115-119
[10]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805