Sharing experiments using open-source software

被引:10
作者
Nelson, Adam [1 ]
Menzies, Tim [1 ]
Gay, Gregory [1 ]
机构
[1] W Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26506 USA
关键词
open source; data mining;
D O I
10.1002/spe.1004
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
When researchers want to repeat, improve or refute prior conclusions, it is useful to have a complete and operational description of prior experiments. If those descriptions are overly long or complex, then sharing their details may not be informative. OURMINE is a scripting environment for the development and deployment of data mining experiments. Using OURMINE, data mining novices can specify and execute intricate experiments, while researchers can publish their complete experimental rig alongside their conclusions. This is achievable because of OURMINE's succinctness. For example, this paper presents two experiments documented in the OURMINE syntax. Thus, the brevity and simplicity of OURMINE recommends it as a better tool for documenting, executing, and sharing data mining experiments. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:283 / 305
页数:23
相关论文
共 21 条
[1]  
[Anonymous], THESIS W VIRGINIA U
[2]  
BASH RC, 1994, BOURNE AGAIN SHELL
[3]  
Chen ZH, 2005, IEEE SOFTWARE, V22, P38, DOI 10.1109/MS.2005.151
[4]  
EISENSTEIN J, 2004, VISUAL LINGUISTIC IN, P113
[5]  
Freund Y, 1999, MACHINE LEARNING, PROCEEDINGS, P124
[6]  
Gay Gregory., 2009, PROMISE 09, P1, DOI DOI 10.1145/1540438.1540460
[7]  
Gupta Chetan., 2004, SIAM INT C DATA MINI
[8]  
Kernighan BrianW., 1988, The AWK Programming Language
[9]  
Kitchenham BA, 2007, IEEE T SOFTWARE ENG, V33, P316, DOI 10.1109/TSE.2007.1101
[10]   Benchmarking classification models for software defect prediction: A proposed framework and novel findings [J].
Lessmann, Stefan ;
Baesens, Bart ;
Mues, Christophe ;
Pietsch, Swantje .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2008, 34 (04) :485-496