Prompting GPT-4 to support automatic safety case generation

被引:7
作者
Sivakumar, Mithila [1 ]
Belle, Alvine B. [1 ]
Shan, Jinjun [1 ]
Shahandashti, Kimya Khakzad [1 ]
机构
[1] York Univ, Lassonde Sch Engn, 4700 Keele St, Toronto, ON M3J 1P3, Canada
关键词
Safety cases; Safety assurance; Machine learning; Large language models; Generative AI; Requirements engineering;
D O I
10.1016/j.eswa.2024.124653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the ever-evolving field of software engineering, the advent of large language models and conversational interfaces, exemplified by ChatGPT, represents a significant revolution. While their potential is evident in various domains, this paper expands upon our previous research, where we experimented with GPT -4, on its ability to create safety cases. A safety case is a structured argument supported by a body of evidence to demonstrate that a given system is safe to operate in a given environment. In this paper, we first determine GPT -4's comprehension of the Goal Structuring Notation (GSN), a well-established notation for visually representing safety cases. Additionally, we conduct four distinct experiments using GPT -4 to evaluate its ability to generate safety cases within a specified system and application domain. To assess GPT -4's performance in this context, we compare the results it produces with the ground-truth safety cases developed for an X-ray system, a machine learning-enabled component for tire noise recognition in a vehicle, and a lane management system from the automotive domain. This comparison enables us to gain valuable insights into the model's generative capabilities. Our findings indicate that GPT -4 is able to generate moderately accurate and reasonable safety cases.
引用
收藏
页数:18
相关论文
共 50 条
[41]   Utilizing GPT-4 and generative artificial intelligence platforms for surgical education: an experimental study on skin ulcers [J].
Seth, Ishith ;
Lim, Bryan ;
Cevik, Jevan ;
Sofiadellis, Foti ;
Ross, Richard J. ;
Cuomo, Roberto ;
Rozen, Warren M. .
EUROPEAN JOURNAL OF PLASTIC SURGERY, 2024, 47 (01)
[42]   When vision meets reality: Exploring the clinical applicability of GPT-4 with vision [J].
Deng, Jiawen ;
Heybati, Kiyan ;
Shammas-Toma, Matthew .
CLINICAL IMAGING, 2024, 108
[43]   Reducing emotional bias in investment decisions: the role of GPT-4 in financial analysis [J].
Hsu, Wen-Chin ;
Wang, Ming-Chun ;
Ting, Hsiu-, I .
ASIA-PACIFIC JOURNAL OF BUSINESS ADMINISTRATION, 2025,
[44]   Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation [J].
Phung, Tung ;
Padurean, Victor-Alexandru ;
Singh, Anjali ;
Brooks, Christopher ;
Cambronero, Jose ;
Gulwani, Sumit .
FOURTEENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, LAK 2024, 2024, :12-23
[45]   An Empirical Evaluation of the GPT-4 Multimodal Language Model on Visualization Literacy Tasks [J].
Bendeck, Alexander ;
Stasko, John .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (01) :1105-1115
[46]   Effects of Different Prompts on the Quality of GPT-4 Responses to Dementia Care Questions [J].
Li, Zhuochun ;
Xief, Bo ;
Hilsabeck, Robin ;
Aguirre, Alyssa ;
Zou, Ning ;
Luo, Zhimeng ;
He, Daqing .
2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, :412-417
[47]   Evaluating the Utility of OpenAI's GPT-4 as a Diagnostic and Management Aid in Medicine [J].
Ge, Alan ;
Pandya, Vidish ;
Ferrick, Kevin J. ;
Krumerman, Andrew .
CIRCULATION, 2023, 148
[48]   Evaluating GPT-4's Ability to Identify Additional Context<bold> </bold> [J].
Armstrong, Victoria ;
Muise, Christian .
2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, :180-181
[49]   Assessing the accuracy of the GPT-4 model in multidisciplinary tumor board decision prediction [J].
Erdat, Efe Cem ;
Yalciner, Merih ;
Oruncu, Mehmet Berk ;
Urun, Yuksel ;
Senler, Filiz cay .
CLINICAL & TRANSLATIONAL ONCOLOGY, 2025, 27 (09) :3793-3802
[50]   Integrating AI in Lipedema Management: Assessing the Efficacy of GPT-4 as a Consultation Assistant [J].
Leypold, Tim ;
Lingens, Lara F. ;
Beier, Justus P. ;
Boos, Anja M. .
LIFE-BASEL, 2024, 14 (05)