Cost Measures Matter for Mutation Testing Study Validity

被引:7
作者
Guizzo, Giovani [1 ]
Sarro, Federica [1 ]
Harman, Mark [1 ]
机构
[1] UCL, Dept Comp Sci, London, England
来源
PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20) | 2020年
基金
欧洲研究理事会;
关键词
Software Testing; Mutation Testing; Mutation Analysis; Cost Reduction; Number of Mutants; Execution Time; Mutant Reduction;
D O I
10.1145/3368089.3409742
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Mutation testing research has often used the number of mutants as a surrogate measure for the true execution cost of generating and executing mutants. This poses a potential threat to the validity of the scientific findings reported in the literature. Out of 75 works surveyed in this paper, we found that 54 (72%) are vulnerable to this threat. To investigate the magnitude of the threat, we conducted an empirical evaluation using 10 real-world programs. The results reveal that: i) percentages of randomly sampled mutants differ from the true execution time, on average, by 44%, varying in difference from 19% to 91%; ii) errors arising from using the surrogate correlate with program size (rho = 0.74) and number of mutants (rho = 0.76), making the problem more pernicious for more realistic programs; iii) scientific findings concerning sampling strategies would have approximately 37% rank disagreement, indicating potentially dramatic impact on experiment validity. To investigate whether this threat matters in practice, we reproduced a seminal study on Selective Mutation (widely relied upon for more than two decades). The impact is stark: an inconclusive scientific finding using the surrogate is transformed to an unequivocal finding when using the true execution cost.
引用
收藏
页码:1127 / 1139
页数:13
相关论文
共 98 条
  • [1] Efficient Product-Line Testing using Cluster-Based Product Prioritization
    Al-Hajjaji, Mustafa
    Krueger, Jacob
    Schulze, Sandro
    Leich, Thomas
    Saake, Gunter
    [J]. 2017 IEEE/ACM 12TH INTERNATIONAL WORKSHOP ON AUTOMATION OF SOFTWARE TESTING (AST), 2017, : 16 - 22
  • [2] Establishing Theoretical Minimal Sets of Mutants
    Ammann, Paul
    Delamaro, Marcio E.
    Offutt, Jeff
    [J]. 2014 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST), 2014, : 21 - 30
  • [3] [Anonymous], 2016, P 25 INT S SOFTWARE
  • [4] [Anonymous], 2013, P 9 JOINT M FDN SOFT
  • [5] A Hitchhiker's guide to statistical tests for assessing randomized algorithms in software engineering
    Arcuri, Andrea
    Briand, Lionel
    [J]. SOFTWARE TESTING VERIFICATION & RELIABILITY, 2014, 24 (03) : 219 - 250
  • [6] Toward the determination of sufficient mutant operators for C
    Barbosa, EF
    Maldonado, JC
    Vincenzi, AMR
    [J]. SOFTWARE TESTING VERIFICATION & RELIABILITY, 2001, 11 (02) : 113 - 136
  • [7] Bluemke Ilona, 2014, Proceedings of the 9th International Conference on Software Engineering and Applications ICSOFT-EA 2014, P297
  • [8] Reductions of Operators in Java']Java Mutation Testing
    Bluemke, Ilona
    Kulesza, Karol
    [J]. PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON DEPENDABILITY AND COMPLEX SYSTEMS DEPCOS-RELCOMEX, 2014, 286 : 93 - 102
  • [9] Reduction of Computational Cost in Mutation Testing by Sampling Mutants
    Bluemke, Ilona
    Kulesza, Karol
    [J]. NEW RESULTS IN DEPENDABILITY AND COMPUTER SYSTEMS, 2013, 224 : 41 - 51
  • [10] 2 NOTIONS OF CORRECTNESS AND THEIR RELATION TO TESTING
    BUDD, TA
    ANGLUIN, D
    [J]. ACTA INFORMATICA, 1982, 18 (01) : 31 - 45