Cost Measures Matter for Mutation Testing Study Validity

被引:7
作者
Guizzo, Giovani [1 ]
Sarro, Federica [1 ]
Harman, Mark [1 ]
机构
[1] UCL, Dept Comp Sci, London, England
来源
PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20) | 2020年
基金
欧洲研究理事会;
关键词
Software Testing; Mutation Testing; Mutation Analysis; Cost Reduction; Number of Mutants; Execution Time; Mutant Reduction;
D O I
10.1145/3368089.3409742
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Mutation testing research has often used the number of mutants as a surrogate measure for the true execution cost of generating and executing mutants. This poses a potential threat to the validity of the scientific findings reported in the literature. Out of 75 works surveyed in this paper, we found that 54 (72%) are vulnerable to this threat. To investigate the magnitude of the threat, we conducted an empirical evaluation using 10 real-world programs. The results reveal that: i) percentages of randomly sampled mutants differ from the true execution time, on average, by 44%, varying in difference from 19% to 91%; ii) errors arising from using the surrogate correlate with program size (rho = 0.74) and number of mutants (rho = 0.76), making the problem more pernicious for more realistic programs; iii) scientific findings concerning sampling strategies would have approximately 37% rank disagreement, indicating potentially dramatic impact on experiment validity. To investigate whether this threat matters in practice, we reproduced a seminal study on Selective Mutation (widely relied upon for more than two decades). The impact is stark: an inconclusive scientific finding using the surrogate is transformed to an unequivocal finding when using the true execution cost.
引用
收藏
页码:1127 / 1139
页数:13
相关论文
共 98 条
  • [31] Mutant reduction based on dominance relation for weak mutation testing
    Gong, Dunwei
    Zhang, Gongjie
    Yao, Xiangjuan
    Meng, Fanlin
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 81 : 82 - 96
  • [32] Mutation Reduction Strategies Considered Harmful
    Gopinath, Rahul
    Ahmed, Iftekhar
    Alipour, Mohammad Amin
    Jensen, Carlos
    Groce, Alex
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2017, 66 (03) : 854 - 874
  • [33] On The Limits of Mutation Reduction Strategies
    Gopinath, Rahul
    Alipour, Mohammad Amin
    Ahmed, Iftekhar
    Jensen, Carlos
    Groce, Alex
    [J]. 2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 511 - 522
  • [34] Gopinath R, 2015, 2015 IEEE 26TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), P216, DOI 10.1109/ISSRE.2015.7381815
  • [35] Guizzo G., 2020, T SOFTWARE ENG
  • [36] Harman Mark, 2010, Proceedings of the IEEE Third International Conference on Software Testing Verification and Validation - Workshops (ICSTW 2010), P80, DOI 10.1109/ICSTW.2010.13
  • [37] Harman M., 2014, Proceedings of the 29th ACM/IEEE international conference on Automated software engineering, P397
  • [39] Reducing Mutants with Mutant Killable Precondition
    Iida, Chihiro
    Takada, Shingo
    [J]. 10TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS - ICSTW 2017, 2017, : 128 - 133
  • [40] Jaechang Nam, 2011, 2011 IEEE Fourth International Conference on Software Testing, Verification and Validation Workshops (ICSTW), P376, DOI 10.1109/ICSTW.2011.57