Prompting GPT-4 to support automatic safety case generation

被引：7

作者：

Sivakumar, Mithila ^{[1
]}

Belle, Alvine B. ^{[1
]}

Shan, Jinjun ^{[1
]}

Shahandashti, Kimya Khakzad ^{[1
]}

机构：

[1] York Univ, Lassonde Sch Engn, 4700 Keele St, Toronto, ON M3J 1P3, Canada

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 255卷

关键词：

Safety cases; Safety assurance; Machine learning; Large language models; Generative AI; Requirements engineering;

D O I：

10.1016/j.eswa.2024.124653

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the ever-evolving field of software engineering, the advent of large language models and conversational interfaces, exemplified by ChatGPT, represents a significant revolution. While their potential is evident in various domains, this paper expands upon our previous research, where we experimented with GPT -4, on its ability to create safety cases. A safety case is a structured argument supported by a body of evidence to demonstrate that a given system is safe to operate in a given environment. In this paper, we first determine GPT -4's comprehension of the Goal Structuring Notation (GSN), a well-established notation for visually representing safety cases. Additionally, we conduct four distinct experiments using GPT -4 to evaluate its ability to generate safety cases within a specified system and application domain. To assess GPT -4's performance in this context, we compare the results it produces with the ground-truth safety cases developed for an X-ray system, a machine learning-enabled component for tire noise recognition in a vehicle, and a lane management system from the automotive domain. This comparison enables us to gain valuable insights into the model's generative capabilities. Our findings indicate that GPT -4 is able to generate moderately accurate and reasonable safety cases.

引用

页数：18

共 50 条

[31] Using GPT-4 Turbo To Automatically Identify Defeaters In Assurance Cases [J].

Shahandashti, Kimya Khakzad ;

Belle, Alvine Boaye ;

Mohajer, Mohammad Mahdi ;

Odu, Oluwafemi ;

Lethbridge, Timothy C. ;

Hemmati, Hadi ;

Wang, Song .

32ND INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS, REW 2024, 2024, :46-56

[32] Assessing the Impact of GPT-4 Turbo in Generating Defeaters for Assurance Cases [J].

Shahandashti, Kimya Khakzad ;

Sivakumar, Mithila ;

Mohajer, Mohammad Mahdi ;

Belle, Alvine B. ;

Wang, Song ;

Lethbridge, Timothy C. .

PROCEEDINGS 2024 IEEE/ACM FIRST INTERNATIONAL CONFERENCE ON AI FOUNDATION MODELS AND SOFTWARE ENGINEERING, FORGE 2024, 2024, :52-56

[33] People cannot distinguish GPT-4 from a human in a Turing test [J].

Jones, Cameron Robert ;

Rathi, Ishika ;

Taylor, Sydney ;

Bergen, Benjamin K. .

PROCEEDINGS OF THE 2025 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2025, 2025, :1615-1639

[34] The potential of GPT-4 advanced data analysis for radiomics-based machine learning models [J].

Foltyn-Dumitru, Martha ;

Rastogi, Aditya ;

Cho, Jaeyoung ;

Schell, Marianne ;

Mahmutoglu, Mustafa Ahmed ;

Kessler, Tobias ;

Sahm, Felix ;

Wick, Wolfgang ;

Bendszus, Martin ;

Brugnara, Gianluca ;

Vollmuth, Philipp .

NEURO-ONCOLOGY ADVANCES, 2025, 7 (01)

[35] Literary characters and GPT-4: from William Shakespeare to Elena Ferrante [J].

Abrams, Gabriel .

DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2025, 40 (01) :1-14

[36] Mind meets machine: Unravelling GPT-4’s cognitive psychology [J].

Dhingra S. ;

Singh M. ;

S.B. V. ;

Malviya N. ;

Gill S.S. .

BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 2023, 3 (03)

[37] Comparative analysis of GPT-4, Gemini, and Ernie as gloss sign language translators in special education [J].

Achraf Othman ;

Khansa Chemnad ;

Ahmed Tlili ;

Ting Da ;

Huanhuan Wang ;

Ronghuai Huang .

Discover Global Society, 2 (1)

[38] Re-evaluating GPT-4's bar exam performance [J].

Martinez, Eric .

ARTIFICIAL INTELLIGENCE AND LAW, 2024,

[39] Diagnostic accuracy of GPT-4 on common clinical scenarios and challenging cases [J].

Rutledge, Geoffrey W. .

LEARNING HEALTH SYSTEMS, 2024, 8 (03)

[40] Große Sprachmodelle wie ChatGPT und GPT-4 für eine patientenzentrierte RadiologieLarge language models such as ChatGPT and GPT-4 for patient-centered care in radiology [J].

Matthias A. Fink .

Die Radiologie, 2023, 63 :665-671

← 1 2 3 4 5 →