The use of large language models for program repair

被引:0
作者
Zubair, Fida [1 ]
Al-Hitmi, Maryam [1 ]
Catal, Cagatay [1 ]
机构
[1] Qatar Univ, Coll Engn, Dept Comp Sci & Engn, Doha 2713, Qatar
关键词
Large language model; Program repair; Software engineering; Automated program repair;
D O I
10.1016/j.csi.2024.103951
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs) have emerged as a promising approach for automated program repair, offering code comprehension and generation capabilities that can address software bugs. Several program repair models based on LLMs have been developed recently. However, findings and insights from these efforts are scattered across various studies, lacking a systematic overview of LLMs' utilization in program repair. Therefore, this Systematic Literature Review (SLR) was conducted to investigate the current landscape of LLM utilization in program repair. This study defined seven research questions and thoroughly selected 41 relevant studies from scientific databases to explore these questions. The results showed the diverse capabilities of LLMs for program repair. The findings revealed that Encoder-Decoder architectures emerged as the most common LLM design for program repair tasks and that mostly open-access datasets were used. Several evaluation metrics were applied, primarily consisting of accuracy, exact match, and BLEU scores. Additionally, the review investigated several LLM fine-tuning methods, including fine-tuning on specialized datasets, curriculum learning, iterative approaches, and knowledge-intensified techniques. These findings pave the way for further research on utilizing the full potential of LLMs to revolutionize automated program repair.
引用
收藏
页数:12
相关论文
共 69 条
  • [41] Pearce H, 2022, Arxiv, DOI arXiv:2112.02125
  • [42] Can OpenAI's Codex Fix Bugs? An evaluation on QuixBugs
    Prenner, Julian Aron
    Babii, Hlib
    Robbes, Romain
    [J]. INTERNATIONAL WORKSHOP ON AUTOMATED PROGRAM REPAIR (APR 2022), 2022, : 69 - 75
  • [43] A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges
    Raiaan, Mohaimenul Azam Khan
    Mukta, Md. Saddam Hossain
    Fatema, Kaniz
    Fahad, Nur Mohammad
    Sakib, Sadman
    Mim, Most Marufatul Jannat
    Ahmad, Jubaer
    Ali, Mohammed Eunus
    Azam, Sami
    [J]. IEEE ACCESS, 2024, 12 : 26839 - 26874
  • [44] Large Language Models for Automated Program Repair
    Ribeiro, Francisco
    [J]. COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 7 - 9
  • [45] GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair
    Ribeiro, Francisco
    Castro de Macedo, Jose Nuno
    Tsushima, Kanae
    Abreu, Rui
    Saraiva, Joao
    [J]. PROCEEDINGS OF THE 16TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING, SLE 2023, 2023, : 111 - 124
  • [46] Bugs.jar: A Large-scale, Diverse Dataset of Real-world Java']Java Bugs
    Saha, Ripon K.
    Lyu, Yingjun
    Lam, Wing
    Yoshida, Hiroaki
    Prasad, Mukul R.
    [J]. 2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2018, : 10 - 13
  • [47] Shirafuji Atsushi, 2023, 2023 12th International Conference on Awareness Science and Technology (iCAST), P178, DOI 10.1109/iCAST57874.2023.10359288
  • [48] Application of machine learning to improve dairy farm management: A systematic literature review
    Slob, Naftali
    Catal, Cagatay
    Kassahun, Ayalew
    [J]. PREVENTIVE VETERINARY MEDICINE, 2021, 187
  • [49] Sobania D, 2023, Arxiv, DOI [arXiv:2301.08653, 10.48550/ARXIV.2301.08653]
  • [50] The Best of Both Worlds: Combining Learned Embeddings with Engineered Features for Accurate Prediction of Correct Patches
    Tian, Haoye
    Liu, Kui
    Li, Yinghua
    Kabore, Abdoul Kader
    Koyuncu, Anil
    Habib, Andrew
    Li, Li
    Wen, Junhao
    Klein, Jacques
    Bissyande, Tegawende F.
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (04)