Views on Internal and External Validity in Empirical Software Engineering

被引:135
作者
Siegmund, Janet [1 ]
Siegmund, Norbert [1 ]
Apel, Sven [1 ]
机构
[1] Univ Passau, Passau, Germany
来源
2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 1 | 2015年
关键词
D O I
10.1109/ICSE.2015.24
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Empirical methods have grown common in software engineering, but there is no consensus on how to apply them properly. Is practical relevance key? Do internally valid studies have any value? Should we replicate more to address the tradeoff between internal and external validity? We asked the community how empirical research should take place in software engineering, with a focus on the tradeoff between internal and external validity and replication, complemented with a literature review about the status of empirical research in software engineering. We found that the opinions differ considerably, and that there is no consensus in the community when to focus on internal or external validity and how to conduct and review replications.
引用
收藏
页码:9 / 19
页数:11
相关论文
共 36 条
[1]  
[Anonymous], 2002, EXPT QUASIEXPERIMENT
[2]  
Basili Victor, 1992, CSTR2956 U MAR COLL
[3]   EXPERIMENTATION IN SOFTWARE ENGINEERING [J].
BASILI, VR ;
SELBY, RW ;
HUTCHENS, DH .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1986, 12 (07) :733-743
[4]   Building knowledge through families of experiments [J].
Basili, VR ;
Lanubile, F .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (04) :456-473
[5]   Presenting software engineering results using structured abstracts: a randomised experiment [J].
Budgen, David ;
Kitchenham, Barbara A. ;
Charters, Stuart M. ;
Turner, Mark ;
Brereton, Pearl ;
Linkman, Stephen G. .
EMPIRICAL SOFTWARE ENGINEERING, 2008, 13 (04) :435-468
[6]   A systematic review of statistical power in software engineering experiments [J].
Dyba, Tore ;
Kampenes, Vigdis By ;
Sjoberg, Dag I. K. .
INFORMATION AND SOFTWARE TECHNOLOGY, 2006, 48 (08) :745-755
[7]  
Hanenberg S., 2010, P ACM INT C OBJ OR P, p[22, An experiment about static and dynamic type systems], DOI DOI 10.1145/1869459.1869462
[8]   Using students as subjects - a comparative study of students and professionals in lead-time impact assessment [J].
Host M. ;
Regnell B. ;
Wohlin C. .
Empirical Software Engineering, 2000, 5 (3) :201-214
[9]  
Hudson W., 2013, GUIDE ADV EMPIRICAL
[10]   A method for evaluating rigor and industrial relevance of technology evaluations [J].
Ivarsson, Martin ;
Gorschek, Tony .
EMPIRICAL SOFTWARE ENGINEERING, 2011, 16 (03) :365-395