Collinearity diagnostics of binary logistic regression model

被引:747
作者
Midi, Habshah [1 ]
Sarkar, S. K. [1 ]
Rana, Sohel [1 ]
机构
[1] Univ Putra Malaysia, Inst Math Res, Lab Appl & Computat Stat, Serdang 43400, Selangor, Malaysia
关键词
Collinearity; tolerance; variance inflation factor; condition index; eigen value; variance proportion;
D O I
10.1080/09720502.2010.10700699
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Multicollinearity is a statistical phenomenon in which predictor variables in a logistic regression model are highly correlated. It is not uncommon when there are a large number of covariates in the model. Multicollinearity has been the thousand pounds monster in statistical modeling. Taming this monster has proven to be one of the great challenges of statistical modeling research. Multicollinearity can cause unstable estimates and inaccurate variances which affects confidence intervals and hypothesis tests. The existence of collinearity inflates the variances of the parameter estimates, and consequently incorrect inferences about relationships between explanatory and response variables. Examining the correlation matrix may be helpful to detect multicollinearity but not sufficient. Much better diagnostics are produced by linear regression with the option tolerance, Vif, condition indices and variance proportions. For moderate to large sample sizes, the approach to drop one of the correlated variables was established entirely satisfactory to reduce multicollinearity. On the light of different collinearity diagnostics, we may safely conclude that without increasing sample size, the second choice to omit one of the correlated variables can reduce multicollinearity to a great extent.
引用
收藏
页码:253 / 267
页数:15
相关论文
共 12 条
[1]  
Allison, 2001, LOGISTIC REGRESSION
[2]   Comparing logit and probit coefficients across groups [J].
Allison, PD .
SOCIOLOGICAL METHODS & RESEARCH, 1999, 28 (02) :186-208
[3]  
Belsley DA., 1980, REGRESSION DIAGNOSTI
[4]  
Field A., 2009, DESCOBRINDO ESTATIST
[5]  
Hosmer DW., 2000, APPL LOGISTIC REGRES, DOI DOI 10.1002/0471722146
[6]  
Kutner M., 2005, APPL LINEAR STAT MOD, VFifth edit.
[7]  
Mayers R., 1990, CLASSICAL MODERN REG
[8]  
Menard S., 2002, APPL LOGISTIC REGRES, V2nd
[9]  
O'Connell, 2005, QUANTITATIVE APPL SO, V146
[10]  
Pampel F. C., 2000, LOGISTIC REGRESSION