BUGSPHP: A dataset for Automated Program Repair in PHP

被引:0
|
作者
Pramod, K. D. [1 ]
De Silva, W. T. N. [1 ]
Thabrew, W. U. K. [1 ]
Shariffdeen, Ridwan [2 ]
Wickramanayake, Sandareka [1 ]
机构
[1] Univ Moratuwa, Moratuwa, Sri Lanka
[2] Natl Univ Singapore, Singapore, Singapore
来源
2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR | 2024年
关键词
Automated Program Repair; PHP Application Errors;
D O I
10.1145/3643991.3644878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated Program Repair (APR) improves developer productivity by saving debugging and bug-fixing time. While APR has been extensively explored for C/C++ and Java programs, there is little research on bugs in PHP programs due to the lack of a benchmark PHP bug dataset. This is surprising given that PHP has been one of the most widely used server-side languages for over two decades, being used in a variety of contexts such as e-commerce, social networking, and content management. This paper presents a benchmark dataset of PHP bugs on real-world applications called BugsPHP, which can enable research on analysis, testing, and repair for PHP programs. The dataset consists of training and test datasets, separately curated from GitHub and processed locally. The training dataset includes more than 600,000 bug-fixing commits. The test dataset contains 513 manually validated bug-fixing commits equipped with developer-provided test cases to assess patch correctness.
引用
收藏
页码:128 / 132
页数:5
相关论文
共 50 条
  • [1] Is the Ground Truth Really Accurate? Dataset Purification for Automated Program Repair
    Yang, Deheng
    Lei, Yan
    Mao, Xiaoguang
    Lo, David
    Xie, Huan
    Yan, Meng
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2021), 2021, : 96 - 107
  • [2] Automatic dataset generation for automated program repair of bugs and vulnerabilities through SonarQube
    del-Hoyo-Gabaldon, Jesus -Angel
    Moreno-Cediel, Antonio
    Garcia-Lopez, Eva
    Garcia-Cabot, Antonio
    de-Fitero-Dominguez, David
    SOFTWAREX, 2024, 26
  • [3] The Impact of Program Reduction on Automated Program Repair
    Vidziunas, Linas
    Binkley, David
    Moonen, Leon
    2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION, ICSME 2024, 2024, : 337 - 349
  • [4] Advancements in automated program repair: a comprehensive review
    Dikici, Sena
    Bilgin, Turgay Tugay
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025,
  • [5] Large Language Models for Automated Program Repair
    Ribeiro, Francisco
    COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 7 - 9
  • [6] Adversarial patch generation for automated program repair
    Alhefdhi, Abdulaziz
    Dam, Hoa Khanh
    Le-Cong, Thanh
    Le, Bach
    Ghose, Aditya
    SOFTWARE QUALITY JOURNAL, 2025, 33 (01)
  • [7] An Assessment of the Quality of Automated Program Operator Repair
    Assiri, Fatmah Yousef
    Bieman, James M.
    2014 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST), 2014, : 273 - 282
  • [8] Improving Automated Program Repair with Domain Adaptation
    Zirak, Armin
    Hemmati, Hadi
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (03)
  • [9] Toward Improving Graftability on Automated Program Repair
    Sumi, Soichi
    Higo, Yoshiki
    Hotta, Keisuke
    Kusumoto, Shinji
    2015 31ST INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME) PROCEEDINGS, 2015, : 511 - 515
  • [10] The Impact of Search Algorithms in Automated Program Repair
    Assiri, Fatmah Yousef
    Bieman, James M.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND SOFTWARE ENGINEERING (SCSE'15), 2015, 62 : 65 - 72