Large Language Models and Simple, Stupid Bugs

被引:11
|
作者
Jesse, Kevin [1 ]
Ahmed, Toufique [1 ]
Devanbu, Premkumar T. [1 ]
Morgan, Emily [1 ]
机构
[1] Univ Calif Davis, Davis, CA 95616 USA
来源
2023 IEEE/ACM 20TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR | 2023年
基金
美国国家科学基金会;
关键词
language models; prompting; deep learning; software engineering;
D O I
10.1109/MSR59073.2023.00082
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the advent of powerful neural language models, AI-based systems to assist developers in coding tasks are becoming widely available; Copilot is one such system. Copilot uses Codex, a large language model (LLM), to complete code conditioned on a preceding "prompt". Codex, however, is trained on public GitHub repositories, viz., on code that may include bugs and vulnerabilities. Previous studies [1], [2] show Codex reproduces vulnerabilities seen in training. In this study, we examine how prone Codex is to generate an interesting bug category, single statement bugs, commonly referred to as simple, stupid bugs or SStuBs in the MSR community. We find that Codex and similar LLMs do help avoid some SStuBs, but do produce known, verbatim SStuBs as much as 2x as likely than known, verbatim correct code. We explore the consequences of the Codex generated SStuBs and propose avoidance strategies that suggest the possibility of reducing the production of known, verbatim SStubs, and increase the possibility of producing known, verbatim fixes.
引用
收藏
页码:563 / 575
页数:13
相关论文
共 50 条
  • [21] Exploring Large Language Models in a Limited Resource Scenario
    Panchbhai, Anand
    Pankanti, Smarana
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 147 - 152
  • [22] Scrutinizing the foundations: could large language models be solipsistic?
    Esanu, Andreea
    SYNTHESE, 2024, 203 (05)
  • [23] Biases in Large Language Models: Origins, Inventory, and Discussion
    Navigli, Roberto
    Conia, Simone
    Ross, Bjorn
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2023, 15 (02):
  • [24] The promises of large language models for protein design and modeling
    Valentini, Giorgio
    Malchiodi, Dario
    Gliozzo, Jessica
    Mesiti, Marco
    Soto-Gomez, Mauricio
    Cabri, Alberto
    Reese, Justin
    Casiraghi, Elena
    Robinson, Peter N.
    FRONTIERS IN BIOINFORMATICS, 2023, 3
  • [25] Large Language Models in Oncology: Revolution or Cause for Concern?
    Caglayan, Aydin
    Slusarczyk, Wojciech
    Rabbani, Rukhshana Dina
    Ghose, Aruni
    Papadopoulos, Vasileios
    Boussios, Stergios
    CURRENT ONCOLOGY, 2024, 31 (04) : 1817 - 1830
  • [26] Prompting Large Language Models to Power Educational Chatbots
    Farah, Juan Carlos
    Ingram, Sandy
    Spaenlehauer, Basile
    Lasne, Fanny Kim-Lan
    Gillet, Denis
    ADVANCES IN WEB-BASED LEARNING, ICWL 2023, 2023, 14409 : 169 - 188
  • [27] Design pattern recognition: a study of large language models
    Pandey, Sushant Kumar
    Chand, Sivajeet
    Horkoff, Jennifer
    Staron, Miroslaw
    Ochodek, Miroslaw
    Durisic, Darko
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
  • [28] Multimodal Food Image Classification with Large Language Models
    Kim, Jun-Hwa
    Kim, Nam-Ho
    Jo, Donghyeok
    Won, Chee Sun
    ELECTRONICS, 2024, 13 (22)
  • [29] Applicability of large language models and generative models for legal case judgement summarization
    Deroy, Aniket
    Ghosh, Kripabandhu
    Ghosh, Saptarshi
    ARTIFICIAL INTELLIGENCE AND LAW, 2024,
  • [30] "No Free Lunch" when using Large Language Models to Verify Self-Generated Programs
    Zilberman, Sol
    Cheng, Betty H. C.
    2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS, ICSTW 2024, 2024, : 29 - 36