THE PROBLEM OF UNDERESTIMATING THE RESIDUAL ERROR VARIANCE IN FORWARD STEPWISE REGRESSION

被引:30
作者
FREEDMAN, LS
PEE, D
MIDTHUNE, DN
机构
[1] NCI,DIV CANC PREVENT & CONTROL,BIOMETRY BRANCH,BETHESDA,MD 20892
[2] INFORMAT MANAGEMENT SERV INC,ROCKVILLE,MD 20852
[3] INFORMAT MANAGEMENT SERV INC,SILVER SPRING,MD 20904
来源
STATISTICIAN | 1992年 / 41卷 / 04期
关键词
D O I
10.2307/2349005
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Under the global null hypothesis that all covariates are unrelated to the outcome variables, forward stepwise regression procedures should have the property that the probability of selecting a given variable and finding it significant at the alpha level is equal to alpha. Because of the problem of underestimating the residual error variance the actual probability can be very different from alpha. This problem becomes of practical concern when the ratio of the number of variables to the number of observations becomes greater than 0.25, and is more serious for logistic than for linear regression.
引用
收藏
页码:405 / 412
页数:8
相关论文
共 15 条
[1]  
BARTOLUCCI A A, 1977, Biometrical Journal, V19, P437, DOI 10.1002/bimj.4710190607
[2]   COMPARISON OF STOPPING RULES IN FORWARD STEPWISE REGRESSION [J].
BENDEL, RB ;
AFIFI, AA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1977, 72 (357) :46-53
[3]  
BUTLER RW, 1982, APPL STAT-J ROY ST C, V31, P290
[4]  
COPAS JB, 1991, STATISTICIAN, V40, P51
[5]  
DRAPER N, 1966, APPLIED REGRESSION A
[6]   DISTRIBUTION OF CERTAIN REGRESSION STATISTICS [J].
DRAPER, NR ;
GUTTMAN, I ;
KANEMASU, H .
BIOMETRIKA, 1971, 58 (02) :295-&
[8]   A NOTE ON SCREENING REGRESSION EQUATIONS [J].
FREEDMAN, DA .
AMERICAN STATISTICIAN, 1983, 37 (02) :152-155
[9]   RETURN TO A NOTE ON SCREENING REGRESSION EQUATIONS [J].
FREEDMAN, LS ;
PEE, D .
AMERICAN STATISTICIAN, 1989, 43 (04) :279-282
[10]   ANALYSIS AND SELECTION OF VARIABLES IN LINEAR-REGRESSION [J].
HOCKING, RR .
BIOMETRICS, 1976, 32 (01) :1-49