A systematic review of effect size in software engineering experiments

被引:226
作者
Kampenes, Vigdis By
Dyba, Tore
Hannay, Jo E.
Sjoberg, Dag I. K.
机构
[1] Dept Software Engn, Simula Res Lab, NO-1325 Lysaker, Norway
[2] Univ Oslo, Dept Informat, NO-0316 Oslo, Norway
[3] SINTEF ICT, NO-7465 Trondheim, Norway
关键词
empirical software engineering; controlled experiments; effect size; statistical significance; practical importance;
D O I
10.1016/j.infsof.2007.02.015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An effect size quantifies the effects of an experimental treatment. Conclusions drawn from hypothesis testing results might be erroneous if effect sizes are not judged in addition to statistical significance. This paper reports a systematic review of 92 controlled experiments published in 12 major software engineering journals and conference proceedings in the decade 1993-2002. The review investigates the practice of effect size reporting, summarizes standardized effect sizes detected in the experiments, discusses the results and gives advice for improvements. Standardized and/or unstandardized effect sizes were reported in 29% of the experiments. Interpretations of the effect sizes in terms of practical importance were not discussed beyond references to standard conventions. The standardized effect sizes computed from the reviewed experiments were equal to observations in psychology studies and slightly larger than standard conventions in behavioral science. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1073 / 1086
页数:14
相关论文
共 45 条
[41]   Statistical, practical, and "clinical": How many kinds of significance do counselors need to consider? [J].
Thompson, B .
JOURNAL OF COUNSELING AND DEVELOPMENT, 2002, 80 (01) :64-71
[42]   Practical guide for reporting effect size in quantitative research in the Journal of Counseling & Development [J].
Trusty, J ;
Thompson, B ;
Petrocelli, JV .
JOURNAL OF COUNSELING AND DEVELOPMENT, 2004, 82 (01) :107-110
[43]   Reporting practices and APA editorial policies regarding statistical significance and effect size [J].
Vacha-Haase, T ;
Nilsson, JE ;
Reetz, DR ;
Lance, TS ;
Thompson, B .
THEORY & PSYCHOLOGY, 2000, 10 (03) :413-425
[44]   How to estimate and interpret various effect sizes [J].
Vacha-Haase, T ;
Thompson, B .
JOURNAL OF COUNSELING PSYCHOLOGY, 2004, 51 (04) :473-481
[45]   Statistical methods in psychology journals -: Guidelines and explanations [J].
Wilkinson, L .
AMERICAN PSYCHOLOGIST, 1999, 54 (08) :594-604