A Practical Guide for Using Statistical Tests to Assess Randomized Algorithms in Software Engineering

被引:621
作者
Arcuri, Andrea [1 ]
Briand, Lionel [1 ]
机构
[1] Simula Res Lab, N-1325 Lysaker, Norway
来源
2011 33RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE) | 2011年
关键词
Statistical difference; effect size; parametric test; non-parametric test; confidence interval; Bonferroni adjustment; systematic review; survey; EFFECT SIZE; T-TEST; EVOLUTIONARY APPROACH; MANN-WHITNEY; BONFERRONI; POWER;
D O I
10.1145/1985793.1985795
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Randomized algorithms have been used to successfully address many different types of software engineering problems. This type of algorithms employ a degree of randomness as part of their logic. Randomized algorithms are useful for difficult problems where a precise solution cannot be derived in a deterministic way within reasonable time. However, randomized algorithms produce different results on every run when applied to the same problem instance. It is hence important to assess the effectiveness of randomized algorithms by collecting data from a large enough number of runs. The use of rigorous statistical tests is then essential to provide support to the conclusions derived by analyzing such data. In this paper, we provide a systematic review of the use of randomized algorithms in selected software engineering venues in 2009. Its goal is not to perform a complete survey but to get a representative snapshot of current practice in software engineering research. We show that randomized algorithms are used in a significant percentage of papers but that, in most cases, randomness is not properly accounted for. This casts doubts on the validity of most empirical results assessing randomized algorithms. There are numerous statistical tests, based on different assumptions, and it is not always clear when and how to use these tests. We hence provide practical guidelines to support empirical research on randomized algorithms in software engineering.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 61 条
  • [1] Mutation Operators for Spreadsheets
    Abraham, Robin
    Erwig, Martin
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2009, 35 (01) : 94 - 108
  • [2] An evolutionary approach to estimating software development projects
    Aguilar-Ruiz, JS
    Ramos, I
    Riquelme, JC
    Toro, M
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2001, 43 (14) : 875 - 882
  • [3] Ali S., 2009, IEEE T SOFTWARE ENG
  • [4] [Anonymous], 2010, Proceedings of the 19th International Symposium on Software Testing and Analysis, ISSTA'10
  • [5] Arcuri A, 2010, LECT NOTES COMPUT SC, V6435, P95, DOI 10.1007/978-3-642-16573-3_8
  • [6] Full Theoretical Runtime Analysis of Alternating Variable Method on the Triangle Classification Problem
    Arcuri, Andrea
    [J]. 1ST INTERNATIONAL SYMPOSIUM ON SEARCH BASED SOFTWARE ENGINEERING, PROCEEDINGS, 2009, : 113 - 121
  • [7] A Novel Co-evolutionary Approach to Automatic Software Bug Fixing
    Arcuri, Andrea
    Yao, Xin
    [J]. 2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 162 - 168
  • [8] The next release problem
    Bagnall, AJ
    Rayward-Smith, VJ
    Whittley, IM
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2001, 43 (14) : 883 - 890
  • [9] Canfora G, 2005, GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, P1069
  • [10] Cohen J., 1988, Statistical power analysis for the behavioral sciences, VSecond