Prompting GPT-4 to support automatic safety case generation

被引：7

作者：

Sivakumar, Mithila ^{[1
]}

Belle, Alvine B. ^{[1
]}

Shan, Jinjun ^{[1
]}

Shahandashti, Kimya Khakzad ^{[1
]}

机构：

[1] York Univ, Lassonde Sch Engn, 4700 Keele St, Toronto, ON M3J 1P3, Canada

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 255卷

关键词：

Safety cases; Safety assurance; Machine learning; Large language models; Generative AI; Requirements engineering;

D O I：

10.1016/j.eswa.2024.124653

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the ever-evolving field of software engineering, the advent of large language models and conversational interfaces, exemplified by ChatGPT, represents a significant revolution. While their potential is evident in various domains, this paper expands upon our previous research, where we experimented with GPT -4, on its ability to create safety cases. A safety case is a structured argument supported by a body of evidence to demonstrate that a given system is safe to operate in a given environment. In this paper, we first determine GPT -4's comprehension of the Goal Structuring Notation (GSN), a well-established notation for visually representing safety cases. Additionally, we conduct four distinct experiments using GPT -4 to evaluate its ability to generate safety cases within a specified system and application domain. To assess GPT -4's performance in this context, we compare the results it produces with the ground-truth safety cases developed for an X-ray system, a machine learning-enabled component for tire noise recognition in a vehicle, and a lane management system from the automotive domain. This comparison enables us to gain valuable insights into the model's generative capabilities. Our findings indicate that GPT -4 is able to generate moderately accurate and reasonable safety cases.

引用

页数：18

共 50 条

[41] Utilizing GPT-4 and generative artificial intelligence platforms for surgical education: an experimental study on skin ulcers [J].

Seth, Ishith ;

Lim, Bryan ;

Cevik, Jevan ;

Sofiadellis, Foti ;

Ross, Richard J. ;

Cuomo, Roberto ;

Rozen, Warren M. .

EUROPEAN JOURNAL OF PLASTIC SURGERY, 2024, 47 (01)

[42] When vision meets reality: Exploring the clinical applicability of GPT-4 with vision [J].

Deng, Jiawen ;

Heybati, Kiyan ;

Shammas-Toma, Matthew .

CLINICAL IMAGING, 2024, 108

[43] Reducing emotional bias in investment decisions: the role of GPT-4 in financial analysis [J].

Hsu, Wen-Chin ;

Wang, Ming-Chun ;

Ting, Hsiu-, I .

ASIA-PACIFIC JOURNAL OF BUSINESS ADMINISTRATION, 2025,

[44] Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation [J].

Phung, Tung ;

Padurean, Victor-Alexandru ;

Singh, Anjali ;

Brooks, Christopher ;

Cambronero, Jose ;

Gulwani, Sumit .

FOURTEENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, LAK 2024, 2024, :12-23

[45] An Empirical Evaluation of the GPT-4 Multimodal Language Model on Visualization Literacy Tasks [J].

Bendeck, Alexander ;

Stasko, John .

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (01) :1105-1115

[46] Effects of Different Prompts on the Quality of GPT-4 Responses to Dementia Care Questions [J].

Li, Zhuochun ;

Xief, Bo ;

Hilsabeck, Robin ;

Aguirre, Alyssa ;

Zou, Ning ;

Luo, Zhimeng ;

He, Daqing .

2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, :412-417

[47] Evaluating the Utility of OpenAI's GPT-4 as a Diagnostic and Management Aid in Medicine [J].

Ge, Alan ;

Pandya, Vidish ;

Ferrick, Kevin J. ;

Krumerman, Andrew .

CIRCULATION, 2023, 148

[48] Evaluating GPT-4's Ability to Identify Additional Context<bold> </bold> [J].

Armstrong, Victoria ;

Muise, Christian .

2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, :180-181

[49] Assessing the accuracy of the GPT-4 model in multidisciplinary tumor board decision prediction [J].

Erdat, Efe Cem ;

Yalciner, Merih ;

Oruncu, Mehmet Berk ;

Urun, Yuksel ;

Senler, Filiz cay .

CLINICAL & TRANSLATIONAL ONCOLOGY, 2025, 27 (09) :3793-3802

[50] Integrating AI in Lipedema Management: Assessing the Efficacy of GPT-4 as a Consultation Assistant [J].

Leypold, Tim ;

Lingens, Lara F. ;

Beier, Justus P. ;

Boos, Anja M. .

LIFE-BASEL, 2024, 14 (05)

← 1 2 3 4 5 →