Stepwise regression in mixed quantitative linear models with autocorrelated errors

被引:12
作者
Alpargu, G
Dutilleul, P [1 ]
机构
[1] McGill Univ, Dept Plant Sci, Montreal, PQ, Canada
[2] Univ Massachusetts, Dept Math & Stat, Amherst, MA 01003 USA
基金
加拿大自然科学与工程研究理事会;
关键词
effective sample size; First differences; First-difference ratios; fixed vs. random explanatory variables; Least squares; Maximum likelihood; modified t-test; restricted maximum likelihood;
D O I
10.1080/03610910500416082
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the stepwise procedure of selection of a fixed or a random explanatory variable in a mixed quantitative linear model with errors following a Gaussian stationary autocorrelated process, we have studied the efficiency of five estimators relative to Generalized Least Squares (GLS): Ordinary Least Squares (OLS), Maximum Likelihood (ML), Restricted Maximum Likelihood (REML), First Differences (FD), and First-Difference Ratios (FDR). We have also studied the validity and power of seven derived testing procedures, to assess the significance of the slope of the candidate explanatory variable x 2 to enter the model in which there is already one regressor x 1 . In addition to five testing procedures of the literature, we considered the FDR t -test with n - 3 df and the modified t -test with n - 3 df for partial correlations, where n is Dutilleul's effective sample size. Efficiency, validity, and power were analyzed by Monte Carlo simulations, as functions of the nature, fixed vs. random (purely random or autocorrelated), of x 1 and x 2 , the sample size and the autocorrelation of random terms in the regression model. We report extensive results for the autocorrelation structure of first-order autoregressive [AR(1)] type, and discuss results we obtained for other autocorrelation structures, such as spherical semivariogram, first-order moving average [MA(1)] and ARMA(1,1), but we could not present because of space constraints. Overall, we found that: the efficiency of slope estimators and the validity of testing procedures depend primarily on the nature of x 2 , but not on that of x 1 ;FDR is the most inefficient slope estimator, regardless of the nature of x 1 and x 2 ;REML is the most efficient of the slope estimators compared relative to GLS, provided the specified autocorrelation structure is correct and the sample size is large enough to ensure the convergence of its optimization algorithm;the FDR t -test, the modified t -test and the REML t -test are the most valid of the testing procedures compared, despite the inefficiency of the FDR and OLS slope estimators for the former two;the FDR t -test, however, suffers from a lack of power that varies with the nature of x 1 and x 2 ; andthe modified t -test for partial correlations, which does not require the specification of an autocorrelation structure, can be recommended when x 1 is fixed or random and x 2 is random, whether purely random or autocorrelated. Our results are illustrated by the environmental data that motivated our work.
引用
收藏
页码:79 / 104
页数:26
相关论文
共 28 条
[1]   To be or not to be valid in testing the significance of the slope in simple quantitative linear models with autocorrelated errors [J].
Alpargu, G ;
Dutilleul, P .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2003, 73 (03) :165-180
[2]   Efficiency and validity analyses of two-stage estimation procedures and derived testing procedures in quantitative linear models with AR(1) errors [J].
Alpargu, G ;
Dutilleul, P .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2003, 32 (03) :799-833
[3]   Efficiency analysis of ten estimation procedures for quantitative linear models with autocorrelated errors [J].
Alpargu, G ;
Dutilleul, P .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2001, 69 (03) :257-275
[4]   MAXIMUM LIKELIHOOD PROCEDURE FOR REGRESSION WITH AUTO-CORRELATED ERRORS [J].
BEACH, CM ;
MACKINNON, JG .
ECONOMETRICA, 1978, 46 (01) :51-58
[5]  
Box G.E. P., 1994, Time Series Analysis: Forecasting Control, V3rd
[6]   ASSESSING THE SIGNIFICANCE OF THE CORRELATION BETWEEN 2 SPATIAL PROCESSES [J].
CLIFFORD, P ;
RICHARDSON, S ;
HEMON, D .
BIOMETRICS, 1989, 45 (01) :123-134
[7]  
Diggle PJ., 1996, Analysis of longitudinal data
[8]   MODIFYING THE T-TEST FOR ASSESSING THE CORRELATION BETWEEN 2 SPATIAL PROCESSES [J].
DUTILLEUL, P .
BIOMETRICS, 1993, 49 (01) :305-314
[9]  
Dutilleul P., 1992, J Stat Comput Simul, V42, P79, DOI DOI 10.1080/00949659208811412
[10]  
FISHER RA, 1950, STAT METHODS EXPT DE