Logistic regression when binary predictor variables are highly correlated

被引:30
作者
Barker, L [1 ]
Brown, C [1 ]
机构
[1] Ctr Dis Control & Prevent, Natl Immunizat Program, Atlanta, GA 30033 USA
关键词
D O I
10.1002/sim.680
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Standard logistic regression can produce estimates having large mean square error when predictor variables are multicollinear. Ridge regression and principal components regression can reduce the impact of multicollinearity in ordinary least squares regression. Generalizations of these, applicable in the logistic regression framework, are alternatives to standard logistic regression. It is shown that estimates obtained via ridge and principal components logistic regression can have smaller mean square error than estimates obtained through standard logistic regression. Recommendations for choosing among standard, ridge and principal components logistic regression are developed. Published in 2001 by John Wiley & Sons, Ltd.
引用
收藏
页码:1431 / 1442
页数:12
相关论文
共 14 条
[1]  
[Anonymous], [No title captured]
[2]  
CLARK L, 1990, STAT MODELS S
[3]  
Efron B., 1993, INTRO BOOTSTRAP, V1st ed., DOI DOI 10.1201/9780429246593
[4]  
Hansen MorrisH., 1953, Sample survey methods and theory, VI.
[5]   RIDGE REGRESSION - BIASED ESTIMATION FOR NONORTHOGONAL PROBLEMS [J].
HOERL, AE ;
KENNARD, RW .
TECHNOMETRICS, 1970, 12 (01) :55-&
[6]   THE VARIMAX CRITERION FOR ANALYTIC ROTATION IN FACTOR-ANALYSIS [J].
KAISER, HF .
PSYCHOMETRIKA, 1958, 23 (03) :187-200
[7]   SIMULATION STUDY OF RIDGE AND OTHER REGRESSION ESTIMATORS [J].
LAWLESS, JF ;
WANG, P .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1976, A 5 (04) :307-323
[8]  
McCullagh P, 1992, GEN LINEAR MODELS
[9]   A simulation study of the number of events per variable in logistic regression analysis [J].
Peduzzi, P ;
Concato, J ;
Kemper, E ;
Holford, TR ;
Feinstein, AR .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1996, 49 (12) :1373-1379
[10]  
RASP J, 1988, AM STAT ASS P BUS EC, P152