BugsInPy: A Database of Existing Bugs in Python']Python Programs to Enable Controlled Testing and Debugging Studies

被引:64
作者
Widyasari, Ratnadira [1 ]
Sim, Sheng Qin [1 ]
Lok, Camellia [1 ]
Qi, Haodi [1 ]
Phan, Jack [1 ]
Tay, Qijin [1 ]
Tan, Constance [1 ]
Wee, Fiona [1 ]
Tan, Jodie Ethelda [1 ]
Yieh, Yuheng [1 ]
Goh, Brian [1 ]
Thung, Ferdian [1 ]
Kang, Hong Jin [1 ]
Hoang, Thong [1 ]
Lo, David [1 ]
Ouh, Eng Lieh [1 ]
机构
[1] Singapore Management Univ, Singapore, Singapore
来源
PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20) | 2020年
关键词
Bug Database; !text type='Python']Python[!/text; Testing and Debugging;
D O I
10.1145/3368089.3417943
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python outperformed Java in terms of popularity. The gap between Python and Java further widened in the 2020 edition of the survey. Unfortunately, despite the rapid increase in Python's popularity, there are not many testing and debugging tools that are designed for Python. This is in stark contrast with the abundance of testing and debugging tools for Java. Thus, there is a need to push research on tools that can help Python developers. One factor that contributed to the rapid growth of Java testing and debugging tools is the availability of benchmarks. A popular benchmark is the Defects4J benchmark; its initial version contained 357 real bugs from 5 real-world Java programs. Each bug comes with a test suite that can expose the bug. Defects4J has been used by hundreds of testing and debugging studies and has helped to push the frontier of research in these directions. In this project, inspired by Defects4J, we create another benchmark database and tool that contain 493 real bugs from 17 real-world Python programs. We hope our benchmark can help catalyze future work on testing and debugging tools that work on Python programs.
引用
收藏
页码:1556 / 1560
页数:5
相关论文
共 18 条
[1]   Defexts: A Curated Dataset of Reproducible Real-World Bugs for Modern JVM Languages [J].
Benton, Samuel ;
Ghanbari, Ali ;
Zhang, Lingming .
2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2019), 2019, :47-50
[2]   Supporting controlled experimentation with testing techniques: An infrastructure and its potential impact [J].
Do, HS ;
Elbaum, S ;
Rothermel, G .
EMPIRICAL SOFTWARE ENGINEERING, 2005, 10 (04) :405-435
[3]  
Durieux Thomas, 2019, ARXIV PREPRINT ARXIV
[4]   BUGSJS']JS: A Benchmark of Java']JavaScript Bugs [J].
Gyimesi, Peter ;
Vancsics, Bela ;
Stocco, Andrea ;
Mazinanian, Davood ;
Beszedes, Arpad ;
Ferenc, Rudolf ;
Mesbah, Ali .
2019 IEEE 12TH CONFERENCE ON SOFTWARE TESTING, VALIDATION AND VERIFICATION (ICST 2019), 2019, :90-101
[5]  
HUTCHINS M, 1994, PROC INT CONF SOFTW, P191, DOI 10.1109/ICSE.1994.296778
[6]  
JUST R, 2014, P 2014 INT S SOFTW T, P437, DOI DOI 10.1145/2610384.2628055
[7]   Are Mutants a Valid Substitute for Real Faults in Software Testing? [J].
Just, Rene ;
Jalali, Darioush ;
Inozemtseva, Laura ;
Ernst, Michael D. ;
Holmes, Reid ;
Fraser, Gordon .
22ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (FSE 2014), 2014, :654-665
[8]  
Le Tien-Duy B, 2016, P 25 INT S SOFTW TES, P177, DOI [10.1145/2931037.2931049, DOI 10.1145/2931037.2931049]
[9]   History Driven Program Repair [J].
Le, Xuan-Bach D. ;
Lo, David ;
Le Goues, Claire .
2016 IEEE 23RD INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), VOL 1, 2016, :213-224
[10]  
Lin D, 2017, PROCEEDINGS COMPANION OF THE 2017 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2017, P55, DOI 10.1145/3135932.3135941