Would wider adoption of reproducible research be beneficial for empirical software engineering research?

被引:28
作者
Madeyski, Lech [1 ]
Kitchenham, Barbara [2 ]
机构
[1] Wroclaw Univ Sci & Technol, Fac Comp Sci & Management, Wyb Wyspianskiego 27, PL-50370 Wroclaw, Poland
[2] Keele Univ, Sch Comp & Math, Keele, Staffs, England
关键词
Reproducible research; empirical software engineering; scientific practice;
D O I
10.3233/JIFS-169146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Researchers have identified problems with the validity of software engineering research findings. In particular, it is often impossible to reproduce data analyses, due to lack of raw data, or sufficient summary statistics, or undefined analysis procedures. The aim of this paper is to raise awareness of the problems caused by unreproducible research in software engineering and to discuss the concept of reproducible research (RR) as a mechanism to address these problems. RR is the idea that the outcome of research is both a paper and its computational environment. We report some recent studies that have cast doubts on the reliability of research outcomes in software engineering. Then we discuss the use of RR as a means of addressing these problems. We discuss the use of RR in software engineering research and present the methodology we have used to adopt RR principles. We report a small working example of how to create reproducible research. We summarise advantages of and problems with adopting RR methods. We conclude that RR supports good scientific practice and would help to address some of the problems found in empirical software engineering research.
引用
收藏
页码:1509 / 1521
页数:13
相关论文
共 48 条
  • [1] Estimating the reproducibility of psychological science
    Aarts, Alexander A.
    Anderson, Joanna E.
    Anderson, Christopher J.
    Attridge, Peter R.
    Attwood, Angela
    Axt, Jordan
    Babel, Molly
    Bahnik, Stepan
    Baranski, Erica
    Barnett-Cowan, Michael
    Bartmess, Elizabeth
    Beer, Jennifer
    Bell, Raoul
    Bentley, Heather
    Beyan, Leah
    Binion, Grace
    Borsboom, Denny
    Bosch, Annick
    Bosco, Frank A.
    Bowman, Sara D.
    Brandt, Mark J.
    Braswell, Erin
    Brohmer, Hilmar
    Brown, Benjamin T.
    Brown, Kristina
    Bruening, Jovita
    Calhoun-Sauls, Ann
    Callahan, Shannon P.
    Chagnon, Elizabeth
    Chandler, Jesse
    Chartier, Christopher R.
    Cheung, Felix
    Christopherson, Cody D.
    Cillessen, Linda
    Clay, Russ
    Cleary, Hayley
    Cloud, Mark D.
    Cohn, Michael
    Cohoon, Johanna
    Columbus, Simon
    Cordes, Andreas
    Costantini, Giulio
    Alvarez, Leslie D. Cramblet
    Cremata, Ed
    Crusius, Jan
    DeCoster, Jamie
    DeGaetano, Michelle A.
    Della Penna, Nicolas
    den Bezemer, Bobby
    Deserno, Marie K.
    [J]. SCIENCE, 2015, 349 (6251)
  • [2] An Open, Large-Scale, Collaborative Effort to Estimate the Reproducibility of Psychological Science
    Alexander, Anita
    Barnett-Cowan, Michael
    Bartmess, Elizabeth
    Bosco, Frank A.
    Brandt, Mark
    Carp, Joshua
    Chandler, Jesse J.
    Clay, Russ
    Cleary, Hayley
    Cohn, Michael
    Costantini, Giulio
    DeCoster, Jamie
    Dunn, Elizabeth
    Eggleston, Casey
    Estel, Vivien
    Farach, Frank J.
    Feather, Jenelle
    Fiedler, Susann
    Field, James G.
    Foster, Joshua D.
    Frank, Michael
    Frazier, Rebecca S.
    Fuchs, Heather M.
    Galak, Jeff
    Galliani, Elisa Maria
    Garcia, Sara
    Giammanco, Elise M.
    Gilbert, Elizabeth A.
    Giner-Sorolla, Roger
    Goellner, Lars
    Goh, Jin X.
    Goss, R. Justin
    Graham, Jesse
    Grange, James A.
    Gray, Jeremy R.
    Gripshover, Sarah
    Hartshorne, Joshua
    Hayes, Timothy B.
    Jahn, Georg
    Johnson, Kate
    Johnston, William
    Joy-Gaba, Jennifer A.
    Lai, Calvin K.
    Lakens, Daniel
    Lane, Kristin
    LeBel, Etienne P.
    Lee, Minha
    Lemm, Kristi
    Mackinnon, Sean
    May, Michael
    [J]. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE, 2012, 7 (06) : 657 - 660
  • [3] [Anonymous], PACKRAT DEPENDENCY M
  • [4] [Anonymous], 2009, P 5 INT C PRED MOD S
  • [5] [Anonymous], 2012, Proceedings of the 2nd International Workshop on Evidential Assessment of software technologies
  • [6] [Anonymous], 2014, Implementing Reproducible Research
  • [7] Barr E., 2010, P FSE SDP WORKSH FUT, P23
  • [8] Basili V.R., 2006, LNCS, V4336
  • [9] Protocols in the use of empirical software engineering artifacts
    Basili, Victor R.
    Zelkowitz, Marvin V.
    Sjoberg, Dag I. K.
    Johnson, Philip
    Cowling, Anthony J.
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2007, 12 (01) : 107 - 119
  • [10] Building knowledge through families of experiments
    Basili, VR
    Lanubile, F
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (04) : 456 - 473