Toward a Common Framework for Statistical Analysis and Development

被引:217
作者
Imai, Kosuke [1 ]
King, Gary [2 ]
Lau, Olivia [3 ]
机构
[1] Princeton Univ, Dept Polit, Princeton, NJ 08544 USA
[2] Harvard Univ, Inst Quantitat Social Sci, Cambridge, MA 02138 USA
[3] Harvard Univ, Dept Govt, Cambridge, MA 02138 USA
基金
美国国家科学基金会;
关键词
Graphical user interface; Interdisciplinary; R language; Statistical ontology; Statistical software;
D O I
10.1198/106186008X384898
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We develop a general ontology of statistical methods and use it to propose a common framework for statistical analysis and software development built on and within the R language, including R's numerous existing packages. This framework offers a simple unified structure and syntax that can encompass a large fraction of existing statistical procedures. We conjecture that it can be used to encompass and present simply a vast majority of existing statistical methods, without requiring changes in existing approaches, and regardless of the theory of inference on which they are based, notation with which they were developed, and programming syntax with which they have been implemented. This development enabled us, and should enable others, to design statistical software with a single, simple, and unified user interface that helps overcome the conflicting notation, syntax, jargon, and statistical methods existing across the methods subfields of numerous academic disciplines. The approach also enables one to build a graphical user interface that automatically includes any method encompassed within the framework. We hope that the result of this line of research will greatly reduce the time from the creation of a new statistical innovation to its widespread use by applied researchers whether or not they use or program in R.
引用
收藏
页码:892 / 913
页数:22
相关论文
共 21 条
[1]  
[Anonymous], 1988, The new S language
[2]  
Bishop CM., 1995, Neural networks for pattern recognition
[3]   Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference [J].
Ho, Daniel E. ;
Imai, Kosuke ;
King, Gary ;
Stuart, Elizabeth A. .
POLITICAL ANALYSIS, 2007, 15 (03) :199-236
[4]  
HO DE, J STAT SOFT IN PRESS
[5]  
Honaker J., 2007, WHAT MISSING VALUES
[6]  
HONAKER J, 2006, AMELIA, V2
[7]  
Ihaka R., 1996, Journal of computational and graphical statistics, V5, P299, DOI [10.1080/10618600.1996.10474713, 10.2307/1390807]
[8]  
Imai Kosuke., 2006, Zelig: Everyone's Statistical Software
[9]   Nonparametric estimation of average treatment effects under exogeneity: A review [J].
Imbens, GW .
REVIEW OF ECONOMICS AND STATISTICS, 2004, 86 (01) :4-29
[10]   The dangers of extreme counterfactuals [J].
King, G ;
Zeng, LC .
POLITICAL ANALYSIS, 2006, 14 (02) :131-159