Cost Measures Matter for Mutation Testing Study Validity

被引：7

作者：

Guizzo, Giovani ^{[1
]}

Sarro, Federica ^{[1
]}

Harman, Mark ^{[1
]}

机构：

[1] UCL, Dept Comp Sci, London, England

来源：

PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20) | 2020年

基金：

欧洲研究理事会;

关键词：

Software Testing; Mutation Testing; Mutation Analysis; Cost Reduction; Number of Mutants; Execution Time; Mutant Reduction;

D O I：

10.1145/3368089.3409742

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Mutation testing research has often used the number of mutants as a surrogate measure for the true execution cost of generating and executing mutants. This poses a potential threat to the validity of the scientific findings reported in the literature. Out of 75 works surveyed in this paper, we found that 54 (72%) are vulnerable to this threat. To investigate the magnitude of the threat, we conducted an empirical evaluation using 10 real-world programs. The results reveal that: i) percentages of randomly sampled mutants differ from the true execution time, on average, by 44%, varying in difference from 19% to 91%; ii) errors arising from using the surrogate correlate with program size (rho = 0.74) and number of mutants (rho = 0.76), making the problem more pernicious for more realistic programs; iii) scientific findings concerning sampling strategies would have approximately 37% rank disagreement, indicating potentially dramatic impact on experiment validity. To investigate whether this threat matters in practice, we reproduced a seminal study on Selective Mutation (widely relied upon for more than two decades). The impact is stark: an inconclusive scientific finding using the surrogate is transformed to an unequivocal finding when using the true execution cost.

引用

页码：1127 / 1139

页数：13

共 98 条

[1] Efficient Product-Line Testing using Cluster-Based Product Prioritization
Al-Hajjaji, Mustafa
Krueger, Jacob
Schulze, Sandro
Leich, Thomas
Saake, Gunter
[J]. 2017 IEEE/ACM 12TH INTERNATIONAL WORKSHOP ON AUTOMATION OF SOFTWARE TESTING (AST), 2017, : 16 - 22
[2] Establishing Theoretical Minimal Sets of Mutants
Ammann, Paul
Delamaro, Marcio E.
Offutt, Jeff
[J]. 2014 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST), 2014, : 21 - 30
[3] [Anonymous], 2016, P 25 INT S SOFTWARE
[4] [Anonymous], 2013, P 9 JOINT M FDN SOFT
[5] A Hitchhiker's guide to statistical tests for assessing randomized algorithms in software engineering
Arcuri, Andrea
Briand, Lionel
[J]. SOFTWARE TESTING VERIFICATION & RELIABILITY, 2014, 24 (03) : 219 - 250
[6] Toward the determination of sufficient mutant operators for C
Barbosa, EF
Maldonado, JC
Vincenzi, AMR
[J]. SOFTWARE TESTING VERIFICATION & RELIABILITY, 2001, 11 (02) : 113 - 136
[7] Bluemke Ilona, 2014, Proceedings of the 9th International Conference on Software Engineering and Applications ICSOFT-EA 2014, P297
[8] Reductions of Operators in Java']Java Mutation Testing
Bluemke, Ilona
Kulesza, Karol
[J]. PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON DEPENDABILITY AND COMPLEX SYSTEMS DEPCOS-RELCOMEX, 2014, 286 : 93 - 102
[9] Reduction of Computational Cost in Mutation Testing by Sampling Mutants
Bluemke, Ilona
Kulesza, Karol
[J]. NEW RESULTS IN DEPENDABILITY AND COMPUTER SYSTEMS, 2013, 224 : 41 - 51
[10] 2 NOTIONS OF CORRECTNESS AND THEIR RELATION TO TESTING
BUDD, TA
ANGLUIN, D
[J]. ACTA INFORMATICA, 1982, 18 (01) : 31 - 45

← 1 2 3 4 5 6 7 8 9 10 →