Prompting GPT-4 to support automatic safety case generation

被引:7
作者
Sivakumar, Mithila [1 ]
Belle, Alvine B. [1 ]
Shan, Jinjun [1 ]
Shahandashti, Kimya Khakzad [1 ]
机构
[1] York Univ, Lassonde Sch Engn, 4700 Keele St, Toronto, ON M3J 1P3, Canada
关键词
Safety cases; Safety assurance; Machine learning; Large language models; Generative AI; Requirements engineering;
D O I
10.1016/j.eswa.2024.124653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the ever-evolving field of software engineering, the advent of large language models and conversational interfaces, exemplified by ChatGPT, represents a significant revolution. While their potential is evident in various domains, this paper expands upon our previous research, where we experimented with GPT -4, on its ability to create safety cases. A safety case is a structured argument supported by a body of evidence to demonstrate that a given system is safe to operate in a given environment. In this paper, we first determine GPT -4's comprehension of the Goal Structuring Notation (GSN), a well-established notation for visually representing safety cases. Additionally, we conduct four distinct experiments using GPT -4 to evaluate its ability to generate safety cases within a specified system and application domain. To assess GPT -4's performance in this context, we compare the results it produces with the ground-truth safety cases developed for an X-ray system, a machine learning-enabled component for tire noise recognition in a vehicle, and a lane management system from the automotive domain. This comparison enables us to gain valuable insights into the model's generative capabilities. Our findings indicate that GPT -4 is able to generate moderately accurate and reasonable safety cases.
引用
收藏
页数:18
相关论文
共 50 条
[31]   Using GPT-4 Turbo To Automatically Identify Defeaters In Assurance Cases [J].
Shahandashti, Kimya Khakzad ;
Belle, Alvine Boaye ;
Mohajer, Mohammad Mahdi ;
Odu, Oluwafemi ;
Lethbridge, Timothy C. ;
Hemmati, Hadi ;
Wang, Song .
32ND INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS, REW 2024, 2024, :46-56
[32]   Assessing the Impact of GPT-4 Turbo in Generating Defeaters for Assurance Cases [J].
Shahandashti, Kimya Khakzad ;
Sivakumar, Mithila ;
Mohajer, Mohammad Mahdi ;
Belle, Alvine B. ;
Wang, Song ;
Lethbridge, Timothy C. .
PROCEEDINGS 2024 IEEE/ACM FIRST INTERNATIONAL CONFERENCE ON AI FOUNDATION MODELS AND SOFTWARE ENGINEERING, FORGE 2024, 2024, :52-56
[33]   People cannot distinguish GPT-4 from a human in a Turing test [J].
Jones, Cameron Robert ;
Rathi, Ishika ;
Taylor, Sydney ;
Bergen, Benjamin K. .
PROCEEDINGS OF THE 2025 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2025, 2025, :1615-1639
[34]   The potential of GPT-4 advanced data analysis for radiomics-based machine learning models [J].
Foltyn-Dumitru, Martha ;
Rastogi, Aditya ;
Cho, Jaeyoung ;
Schell, Marianne ;
Mahmutoglu, Mustafa Ahmed ;
Kessler, Tobias ;
Sahm, Felix ;
Wick, Wolfgang ;
Bendszus, Martin ;
Brugnara, Gianluca ;
Vollmuth, Philipp .
NEURO-ONCOLOGY ADVANCES, 2025, 7 (01)
[35]   Literary characters and GPT-4: from William Shakespeare to Elena Ferrante [J].
Abrams, Gabriel .
DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2025, 40 (01) :1-14
[36]   Mind meets machine: Unravelling GPT-4’s cognitive psychology [J].
Dhingra S. ;
Singh M. ;
S.B. V. ;
Malviya N. ;
Gill S.S. .
BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 2023, 3 (03)
[37]   Comparative analysis of GPT-4, Gemini, and Ernie as gloss sign language translators in special education [J].
Achraf Othman ;
Khansa Chemnad ;
Ahmed Tlili ;
Ting Da ;
Huanhuan Wang ;
Ronghuai Huang .
Discover Global Society, 2 (1)
[38]   Re-evaluating GPT-4's bar exam performance [J].
Martinez, Eric .
ARTIFICIAL INTELLIGENCE AND LAW, 2024,
[39]   Diagnostic accuracy of GPT-4 on common clinical scenarios and challenging cases [J].
Rutledge, Geoffrey W. .
LEARNING HEALTH SYSTEMS, 2024, 8 (03)
[40]   Große Sprachmodelle wie ChatGPT und GPT-4 für eine patientenzentrierte RadiologieLarge language models such as ChatGPT and GPT-4 for patient-centered care in radiology [J].
Matthias A. Fink .
Die Radiologie, 2023, 63 :665-671