Large Language Models and Simple, Stupid Bugs

被引：11

作者：

Jesse, Kevin ^{[1
]}

Ahmed, Toufique ^{[1
]}

Devanbu, Premkumar T. ^{[1
]}

Morgan, Emily ^{[1
]}

机构：

[1] Univ Calif Davis, Davis, CA 95616 USA

来源：

2023 IEEE/ACM 20TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR | 2023年

基金：

美国国家科学基金会;

关键词：

language models; prompting; deep learning; software engineering;

D O I：

10.1109/MSR59073.2023.00082

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

With the advent of powerful neural language models, AI-based systems to assist developers in coding tasks are becoming widely available; Copilot is one such system. Copilot uses Codex, a large language model (LLM), to complete code conditioned on a preceding "prompt". Codex, however, is trained on public GitHub repositories, viz., on code that may include bugs and vulnerabilities. Previous studies [1], [2] show Codex reproduces vulnerabilities seen in training. In this study, we examine how prone Codex is to generate an interesting bug category, single statement bugs, commonly referred to as simple, stupid bugs or SStuBs in the MSR community. We find that Codex and similar LLMs do help avoid some SStuBs, but do produce known, verbatim SStuBs as much as 2x as likely than known, verbatim correct code. We explore the consequences of the Codex generated SStuBs and propose avoidance strategies that suggest the possibility of reducing the production of known, verbatim SStubs, and increase the possibility of producing known, verbatim fixes.

引用

页码：563 / 575

页数：13

共 50 条

[21] Exploring Large Language Models in a Limited Resource Scenario
Panchbhai, Anand
Pankanti, Smarana
2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 147 - 152
[22] Scrutinizing the foundations: could large language models be solipsistic?
Esanu, Andreea
SYNTHESE, 2024, 203 (05)
[23] Biases in Large Language Models: Origins, Inventory, and Discussion
Navigli, Roberto
Conia, Simone
Ross, Bjorn
ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2023, 15 (02):
[24] The promises of large language models for protein design and modeling
Valentini, Giorgio
Malchiodi, Dario
Gliozzo, Jessica
Mesiti, Marco
Soto-Gomez, Mauricio
Cabri, Alberto
Reese, Justin
Casiraghi, Elena
Robinson, Peter N.
FRONTIERS IN BIOINFORMATICS, 2023, 3
[25] Large Language Models in Oncology: Revolution or Cause for Concern?
Caglayan, Aydin
Slusarczyk, Wojciech
Rabbani, Rukhshana Dina
Ghose, Aruni
Papadopoulos, Vasileios
Boussios, Stergios
CURRENT ONCOLOGY, 2024, 31 (04) : 1817 - 1830
[26] Prompting Large Language Models to Power Educational Chatbots
Farah, Juan Carlos
Ingram, Sandy
Spaenlehauer, Basile
Lasne, Fanny Kim-Lan
Gillet, Denis
ADVANCES IN WEB-BASED LEARNING, ICWL 2023, 2023, 14409 : 169 - 188
[27] Design pattern recognition: a study of large language models
Pandey, Sushant Kumar
Chand, Sivajeet
Horkoff, Jennifer
Staron, Miroslaw
Ochodek, Miroslaw
Durisic, Darko
EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
[28] Multimodal Food Image Classification with Large Language Models
Kim, Jun-Hwa
Kim, Nam-Ho
Jo, Donghyeok
Won, Chee Sun
ELECTRONICS, 2024, 13 (22)
[29] Applicability of large language models and generative models for legal case judgement summarization
Deroy, Aniket
Ghosh, Kripabandhu
Ghosh, Saptarshi
ARTIFICIAL INTELLIGENCE AND LAW, 2024,
[30] "No Free Lunch" when using Large Language Models to Verify Self-Generated Programs
Zilberman, Sol
Cheng, Betty H. C.
2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS, ICSTW 2024, 2024, : 29 - 36

← 1 2 3 4 5 →