A systematic review of effect size in software engineering experiments

被引:225
作者
Kampenes, Vigdis By
Dyba, Tore
Hannay, Jo E.
Sjoberg, Dag I. K.
机构
[1] Dept Software Engn, Simula Res Lab, NO-1325 Lysaker, Norway
[2] Univ Oslo, Dept Informat, NO-0316 Oslo, Norway
[3] SINTEF ICT, NO-7465 Trondheim, Norway
关键词
empirical software engineering; controlled experiments; effect size; statistical significance; practical importance;
D O I
10.1016/j.infsof.2007.02.015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An effect size quantifies the effects of an experimental treatment. Conclusions drawn from hypothesis testing results might be erroneous if effect sizes are not judged in addition to statistical significance. This paper reports a systematic review of 92 controlled experiments published in 12 major software engineering journals and conference proceedings in the decade 1993-2002. The review investigates the practice of effect size reporting, summarizes standardized effect sizes detected in the experiments, discusses the results and gives advice for improvements. Standardized and/or unstandardized effect sizes were reported in 29% of the experiments. Interpretations of the effect sizes in terms of practical importance were not discussed beyond references to standard conventions. The standardized effect sizes computed from the reviewed experiments were equal to observations in psychology studies and slightly larger than standard conventions in behavioral science. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1073 / 1086
页数:14
相关论文
共 45 条
  • [1] The revised CONSORT statement for reporting randomized trials: Explanation and elaboration
    Altman, DG
    Schulz, KF
    Moher, D
    Egger, M
    Davidoff, F
    Elbourne, D
    Gotzsche, PC
    Lang, T
    [J]. ANNALS OF INTERNAL MEDICINE, 2001, 134 (08) : 663 - 694
  • [2] [Anonymous], ES COMPUTER PROGRAM
  • [3] [Anonymous], 2005, EFFECT SIZE RES BROA
  • [4] Effect size estimation: Factors to consider and mistakes to avoid
    Breaugh, JA
    [J]. JOURNAL OF MANAGEMENT, 2003, 29 (01) : 79 - 97
  • [5] COCHRAN W. G., 1937, J. Roy. Statist. Soc. 1937., (Suppl.), V4, P102
  • [6] THINGS I HAVE LEARNED (SO FAR)
    COHEN, J
    [J]. AMERICAN PSYCHOLOGIST, 1990, 45 (12) : 1304 - 1312
  • [7] COHEN J, 1994, AM PSYCHOL, V49, P997, DOI 10.1037/0003-066X.50.12.1103
  • [8] A POWER PRIMER
    COHEN, J
    [J]. PSYCHOLOGICAL BULLETIN, 1992, 112 (01) : 155 - 159
  • [9] COHEN J, 1965, HDB CLIN PSYCH
  • [10] Cohen J., 1988, POWERSTATISTICALSCIE, DOI 10.4324/9780203771587