Codeflaws: A Programming Competition Benchmark for Evaluating Automated Program Repair Tools

被引:92
|
作者
Tan, Shin Hwei [1 ]
Yi, Jooyong [2 ]
Yulis [1 ]
Mechtaev, Sergey [1 ]
Roychoudhury, Abhik [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Innopolis Univ, Innopolis, Russia
来源
PROCEEDINGS OF THE 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C 2017) | 2017年
关键词
automated program repair; defect classes; empirical evaluation; benchmark;
D O I
10.1109/ICSE-C.2017.76
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Several automated program repair techniques have been proposed to reduce the time and effort spent in bug-fixing. While these repair tools are designed to be generic such that they could address many software faults, different repair tools may fix certain types of faults more effectively than other tools. Therefore, it is important to compare more objectively the effectiveness of different repair tools on various fault types. However, existing benchmarks on automated program repairs do not allow thorough investigation of the relationship between fault types and the effectiveness of repair tools. We present Codeflaws, a set of 3902 defects from 7436 programs automatically classified across 39 defect classes (we refer to different types of fault as defect classes derived from the syntactic differences between a buggy program and a patched program).
引用
收藏
页码:180 / 182
页数:3
相关论文
共 50 条
  • [31] Do automated program repair techniques repair hard and important bugs?
    Manish Motwani
    Sandhya Sankaranarayanan
    René Just
    Yuriy Brun
    Empirical Software Engineering, 2018, 23 : 2901 - 2947
  • [32] Do automated program repair techniques repair hard and important bugs?
    Motwani, Manish
    Sankaranarayanan, Sandhya
    Just, Rene
    Brun, Yuriy
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (05) : 2901 - 2947
  • [33] Improved Crossover Operators for Genetic Programming for Program Repair
    Oliveira, Vinicius Paulo L.
    Souza, Eduardo F. D.
    Le Goues, Claire
    Camilo-Junior, Celso G.
    SEARCH BASED SOFTWARE ENGINEERING, SSBSE 2016, 2016, 9962 : 112 - 127
  • [34] A correlation study between automated program repair and test-suite metrics
    Yi, Jooyong
    Tan, Shin Hwei
    Mechtaev, Sergey
    Bohme, Marcel
    Roychoudhury, Abhik
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (05) : 2948 - 2979
  • [35] A correlation study between automated program repair and test-suite metrics
    Jooyong Yi
    Shin Hwei Tan
    Sergey Mechtaev
    Marcel Böhme
    Abhik Roychoudhury
    Empirical Software Engineering, 2018, 23 : 2948 - 2979
  • [36] Proposal of Multimodal Program Optimization Benchmark and Its Application to Multimodal Genetic Programming
    Harada, Tomohiro
    Murano, Kei
    Thawonmas, Ruck
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [37] A critical review on the evaluation of automated program repair systems
    Liu, Kui
    Li, Li
    Koyuncu, Anil
    Kim, Dongsun
    Liu, Zhe
    Klein, Jacques
    Bissyande, Tegawende F.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2021, 171
  • [38] Automatic Software Merging using Automated Program Repair
    Xing, Xiaoqian
    Maruyama, Katsuhisa
    2019 IEEE 1ST INTERNATIONAL WORKSHOP ON INTELLIGENT BUG FIXING (IBF '19), 2019, : 11 - 16
  • [39] ThinkRepair: Self-Directed Automated Program Repair
    Yin, Xin
    Ni, Chao
    Wang, Shaohua
    Li, Zhenhao
    Zeng, Limin
    Yang, Xiaohu
    PROCEEDINGS OF THE 33RD ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2024, 2024, : 1274 - 1286
  • [40] Overfitting in semantics-based automated program repair
    Xuan Bach D. Le
    Ferdian Thung
    David Lo
    Claire Le Goues
    Empirical Software Engineering, 2018, 23 : 3007 - 3033