The use of large language models for program repair

被引：0

作者：

Zubair, Fida ^{[1
]}

Al-Hitmi, Maryam ^{[1
]}

Catal, Cagatay ^{[1
]}

机构：

[1] Qatar Univ, Coll Engn, Dept Comp Sci & Engn, Doha 2713, Qatar

来源：

COMPUTER STANDARDS & INTERFACES | 2025年 / 93卷

关键词：

Large language model; Program repair; Software engineering; Automated program repair;

D O I：

10.1016/j.csi.2024.103951

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large Language Models (LLMs) have emerged as a promising approach for automated program repair, offering code comprehension and generation capabilities that can address software bugs. Several program repair models based on LLMs have been developed recently. However, findings and insights from these efforts are scattered across various studies, lacking a systematic overview of LLMs' utilization in program repair. Therefore, this Systematic Literature Review (SLR) was conducted to investigate the current landscape of LLM utilization in program repair. This study defined seven research questions and thoroughly selected 41 relevant studies from scientific databases to explore these questions. The results showed the diverse capabilities of LLMs for program repair. The findings revealed that Encoder-Decoder architectures emerged as the most common LLM design for program repair tasks and that mostly open-access datasets were used. Several evaluation metrics were applied, primarily consisting of accuracy, exact match, and BLEU scores. Additionally, the review investigated several LLM fine-tuning methods, including fine-tuning on specialized datasets, curriculum learning, iterative approaches, and knowledge-intensified techniques. These findings pave the way for further research on utilizing the full potential of LLMs to revolutionize automated program repair.

引用

页数：12

共 69 条

[41] Pearce H, 2022, Arxiv, DOI arXiv:2112.02125
[42] Can OpenAI's Codex Fix Bugs? An evaluation on QuixBugs
Prenner, Julian Aron
Babii, Hlib
Robbes, Romain
[J]. INTERNATIONAL WORKSHOP ON AUTOMATED PROGRAM REPAIR (APR 2022), 2022, : 69 - 75
[43] A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges
Raiaan, Mohaimenul Azam Khan
Mukta, Md. Saddam Hossain
Fatema, Kaniz
Fahad, Nur Mohammad
Sakib, Sadman
Mim, Most Marufatul Jannat
Ahmad, Jubaer
Ali, Mohammed Eunus
Azam, Sami
[J]. IEEE ACCESS, 2024, 12 : 26839 - 26874
[44] Large Language Models for Automated Program Repair
Ribeiro, Francisco
[J]. COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 7 - 9
[45] GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair
Ribeiro, Francisco
Castro de Macedo, Jose Nuno
Tsushima, Kanae
Abreu, Rui
Saraiva, Joao
[J]. PROCEEDINGS OF THE 16TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING, SLE 2023, 2023, : 111 - 124
[46] Bugs.jar: A Large-scale, Diverse Dataset of Real-world Java']Java Bugs
Saha, Ripon K.
Lyu, Yingjun
Lam, Wing
Yoshida, Hiroaki
Prasad, Mukul R.
[J]. 2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2018, : 10 - 13
[47] Shirafuji Atsushi, 2023, 2023 12th International Conference on Awareness Science and Technology (iCAST), P178, DOI 10.1109/iCAST57874.2023.10359288
[48] Application of machine learning to improve dairy farm management: A systematic literature review
Slob, Naftali
Catal, Cagatay
Kassahun, Ayalew
[J]. PREVENTIVE VETERINARY MEDICINE, 2021, 187
[49] Sobania D, 2023, Arxiv, DOI [arXiv:2301.08653, 10.48550/ARXIV.2301.08653]
[50] The Best of Both Worlds: Combining Learned Embeddings with Engineered Features for Accurate Prediction of Correct Patches
Tian, Haoye
Liu, Kui
Li, Yinghua
Kabore, Abdoul Kader
Koyuncu, Anil
Habib, Andrew
Li, Li
Wen, Junhao
Klein, Jacques
Bissyande, Tegawende F.
[J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (04)

← 1 2 3 4 5 6 7 →