Codeflaws: A Programming Competition Benchmark for Evaluating Automated Program Repair Tools

被引：92

作者：

Tan, Shin Hwei ^{[1
]}

Yi, Jooyong ^{[2
]}

Yulis ^{[1
]}

Mechtaev, Sergey ^{[1
]}

Roychoudhury, Abhik ^{[1
]}

机构：

[1] Natl Univ Singapore, Singapore, Singapore

[2] Innopolis Univ, Innopolis, Russia

来源：

PROCEEDINGS OF THE 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C 2017) | 2017年

关键词：

automated program repair; defect classes; empirical evaluation; benchmark;

D O I：

10.1109/ICSE-C.2017.76

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Several automated program repair techniques have been proposed to reduce the time and effort spent in bug-fixing. While these repair tools are designed to be generic such that they could address many software faults, different repair tools may fix certain types of faults more effectively than other tools. Therefore, it is important to compare more objectively the effectiveness of different repair tools on various fault types. However, existing benchmarks on automated program repairs do not allow thorough investigation of the relationship between fault types and the effectiveness of repair tools. We present Codeflaws, a set of 3902 defects from 7436 programs automatically classified across 39 defect classes (we refer to different types of fault as defect classes derived from the syntactic differences between a buggy program and a patched program).

引用

页码：180 / 182

页数：3

共 50 条

[31] Do automated program repair techniques repair hard and important bugs?
Manish Motwani
Sandhya Sankaranarayanan
René Just
Yuriy Brun
Empirical Software Engineering, 2018, 23 : 2901 - 2947
[32] Do automated program repair techniques repair hard and important bugs?
Motwani, Manish
Sankaranarayanan, Sandhya
Just, Rene
Brun, Yuriy
EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (05) : 2901 - 2947
[33] Improved Crossover Operators for Genetic Programming for Program Repair
Oliveira, Vinicius Paulo L.
Souza, Eduardo F. D.
Le Goues, Claire
Camilo-Junior, Celso G.
SEARCH BASED SOFTWARE ENGINEERING, SSBSE 2016, 2016, 9962 : 112 - 127
[34] A correlation study between automated program repair and test-suite metrics
Yi, Jooyong
Tan, Shin Hwei
Mechtaev, Sergey
Bohme, Marcel
Roychoudhury, Abhik
EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (05) : 2948 - 2979
[35] A correlation study between automated program repair and test-suite metrics
Jooyong Yi
Shin Hwei Tan
Sergey Mechtaev
Marcel Böhme
Abhik Roychoudhury
Empirical Software Engineering, 2018, 23 : 2948 - 2979
[36] Proposal of Multimodal Program Optimization Benchmark and Its Application to Multimodal Genetic Programming
Harada, Tomohiro
Murano, Kei
Thawonmas, Ruck
2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
[37] A critical review on the evaluation of automated program repair systems
Liu, Kui
Li, Li
Koyuncu, Anil
Kim, Dongsun
Liu, Zhe
Klein, Jacques
Bissyande, Tegawende F.
JOURNAL OF SYSTEMS AND SOFTWARE, 2021, 171
[38] Automatic Software Merging using Automated Program Repair
Xing, Xiaoqian
Maruyama, Katsuhisa
2019 IEEE 1ST INTERNATIONAL WORKSHOP ON INTELLIGENT BUG FIXING (IBF '19), 2019, : 11 - 16
[39] ThinkRepair: Self-Directed Automated Program Repair
Yin, Xin
Ni, Chao
Wang, Shaohua
Li, Zhenhao
Zeng, Limin
Yang, Xiaohu
PROCEEDINGS OF THE 33RD ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2024, 2024, : 1274 - 1286
[40] Overfitting in semantics-based automated program repair
Xuan Bach D. Le
Ferdian Thung
David Lo
Claire Le Goues
Empirical Software Engineering, 2018, 23 : 3007 - 3033

← 1 2 3 4 5 →