Investigating the Performance of Alternate Regression Weights by Studying All Possible Criteria in Regression Models with a Fixed Set of Predictors

被引:0
作者
Niels Waller
Jeff Jones
机构
[1] University of Minnesota,Department of Psychology
来源
Psychometrika | 2011年 / 76卷
关键词
Monte Carlo; multiple regression; weighting;
D O I
暂无
中图分类号
学科分类号
摘要
We describe methods for assessing all possible criteria (i.e., dependent variables) and subsets of criteria for regression models with a fixed set of predictors, x (where x is an n×1 vector of independent variables). Our methods build upon the geometry of regression coefficients (hereafter called regression weights) in n-dimensional space. For a full-rank predictor correlation matrix, Rxx, of order n, and for regression models with constant R2 (coefficient of determination), the OLS weight vectors for all possible criteria terminate on the surface of an n-dimensional ellipsoid. The population performance of alternate regression weights—such as equal weights, correlation weights, or rounded weights—can be modeled as a function of the Cartesian coordinates of the ellipsoid. These geometrical notions can be easily extended to assess the sampling performance of alternate regression weights in models with either fixed or random predictors and for models with any value of R2. To illustrate these ideas, we describe algorithms and R (R Development Core Team, 2009) code for: (1) generating points that are uniformly distributed on the surface of an n-dimensional ellipsoid, (2) populating the set of regression (weight) vectors that define an elliptical arc in ℝn, and (3) populating the set of regression vectors that have constant cosine with a target vector in ℝn. Each algorithm is illustrated with real data. The examples demonstrate the usefulness of studying all possible criteria when evaluating alternate regression weights in regression models with a fixed set of predictors.
引用
收藏
相关论文
共 71 条
[1]  
Ackerman P.L.(1997)Intelligence, personality, and interests: Evidence for overlapping traits Psychological Bulletin 121 219-245
[2]  
Heggestad E.D.(1980)Multivariate analysis with latent variables: causal modeling Annual Review of Psychology 31 419-456
[3]  
Bentler P.(1988)A quirk in multiple regression: the whole regression can be greater than the sum of its parts The Statistician 37 371-374
[4]  
Bertrand P.V.(1994)On simulation of random vectors with given densities in regions and on their boundaries Journal of Applied Probability 31 205-220
[5]  
Holder R.L.(2007)Simulation studies of a phenomenological model for elongated virus capsid formation Physical Review E 75 1-7
[6]  
Borovkov K.(1972)A comparison of five variable weighting procedures Educational and Psychological Measurement 32 31-322
[7]  
Chen T.(1974)A revised definition for suppressor variables: a guide to their identification and interpretation Educational and Psychological Measurement 34 35-46
[8]  
Glotzer S.C.(1955)Construct validity in psychological tests Psychological Bulletin 52 281-302
[9]  
Claudy J.G.(2004)The superiority of simple alternatives to regression for social science predictions Journal of Educational and Behavioral Statistics 29 317-331
[10]  
Conger A.J.(2010)A constrained linear estimator for multiple regression Psychometrika 75 521-541