BUGSPHP: A dataset for Automated Program Repair in PHP

被引:0
|
作者
Pramod, K. D. [1 ]
De Silva, W. T. N. [1 ]
Thabrew, W. U. K. [1 ]
Shariffdeen, Ridwan [2 ]
Wickramanayake, Sandareka [1 ]
机构
[1] Univ Moratuwa, Moratuwa, Sri Lanka
[2] Natl Univ Singapore, Singapore, Singapore
来源
2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR | 2024年
关键词
Automated Program Repair; PHP Application Errors;
D O I
10.1145/3643991.3644878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated Program Repair (APR) improves developer productivity by saving debugging and bug-fixing time. While APR has been extensively explored for C/C++ and Java programs, there is little research on bugs in PHP programs due to the lack of a benchmark PHP bug dataset. This is surprising given that PHP has been one of the most widely used server-side languages for over two decades, being used in a variety of contexts such as e-commerce, social networking, and content management. This paper presents a benchmark dataset of PHP bugs on real-world applications called BugsPHP, which can enable research on analysis, testing, and repair for PHP programs. The dataset consists of training and test datasets, separately curated from GitHub and processed locally. The training dataset includes more than 600,000 bug-fixing commits. The test dataset contains 513 manually validated bug-fixing commits equipped with developer-provided test cases to assess patch correctness.
引用
收藏
页码:128 / 132
页数:5
相关论文
共 50 条
  • [21] ThinkRepair: Self-Directed Automated Program Repair
    Yin, Xin
    Ni, Chao
    Wang, Shaohua
    Li, Zhenhao
    Zeng, Limin
    Yang, Xiaohu
    PROCEEDINGS OF THE 33RD ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2024, 2024, : 1274 - 1286
  • [22] Overfitting in semantics-based automated program repair
    Xuan Bach D. Le
    Ferdian Thung
    David Lo
    Claire Le Goues
    Empirical Software Engineering, 2018, 23 : 3007 - 3033
  • [23] Impact of Code Language Models on Automated Program Repair
    Jiang, Nan
    Liu, Kevin
    Lutellier, Thibaud
    Tan, Lin
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 1430 - 1442
  • [24] Be Realistic: Automated Program Repair is a Combination of Undecidable Problems
    Nilizadeh, Amirfarhad
    Leavens, Gary T.
    INTERNATIONAL WORKSHOP ON AUTOMATED PROGRAM REPAIR (APR 2022), 2022, : 31 - 32
  • [25] An Investigation into the Use of Mutation Analysis for Automated Program Repair
    Timperley, Christopher Steven
    Stepney, Susan
    Le Goues, Claire
    SEARCH BASED SOFTWARE ENGINEERING, SSBSE 2017, 2017, 10452 : 99 - 114
  • [26] Evaluating the Strategies of Statement Selection in Automated Program Repair
    Yang, Deheng
    Qi, Yuhua
    Mao, Xiaoguang
    SOFTWARE ANALYSIS, TESTING, AND EVOLUTION, SATE 2018, 2018, 11293 : 33 - 48
  • [27] Automated Large Program Repair based on Big Code
    Hoang Van Thuy
    Phan Viet Anh
    Nguyen Xuan Hoai
    PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2018), 2018, : 375 - 381
  • [28] Applying Automated Program Repair to Dataflow Programming Languages
    Huang, Yu
    Ahmad, Hammad
    Forrest, Stephanie
    Weimer, Westley
    2021 IEEE/ACM INTERNATIONAL WORKSHOP ON GENETIC IMPROVEMENT (GI 2021), 2021, : 21 - 22
  • [29] Automated program repair: a step towards software automation
    Abhik ROYCHOUDHURY
    Yingfei XIONG
    ScienceChina(InformationSciences), 2019, 62 (10) : 47 - 49
  • [30] Overfitting in semantics-based automated program repair
    Le, Xuan Bach D.
    Thung, Ferdian
    Lo, David
    Le Goues, Claire
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (05) : 3007 - 3033