Model averaging: A shrinkage perspective

被引:0
作者
Peng, Jingfu [1 ]
机构
[1] Tsinghua Univ, Yau Math Sci Ctr, Beijing, Peoples R China
来源
ELECTRONIC JOURNAL OF STATISTICS | 2024年 / 18卷 / 02期
关键词
Model averaging; secondary Stein shrinkage; penalized blockwise Stein rule; asymptotic optimality; REGRESSION; AGGREGATION; ESTIMATORS; RULE; INFERENCE;
D O I
10.1214/24-EJS2282
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Model averaging (MA), a technique for combining estimators from a set of candidate models, has attracted increasing attention in machine learning and statistics. In the existing literature, there is an implicit understanding that MA can be viewed as a form of shrinkage estimation that draws the response vector towards the subspaces spanned by the candidate models. This paper explores this perspective by establishing connections between MA and shrinkage in a linear regression setting with multiple nested models. We first demonstrate that the optimal MA estimator is the best linear estimator with monotonically non-increasing weights in a Gaussian sequence model. The Mallows MA (MMA), which estimates weights by minimizing the Mallows' C p over the unit simplex, can be viewed as a variation of the sum of a set of positive-part Stein estimators. Indeed, the latter estimator differs from the MMA only in that its optimization of Mallows' C p is within a suitably relaxed weight set. Motivated by these connections, we develop a novel MA procedure based on a blockwise Stein estimation. The resulting Stein-type MA estimator is asymptotically optimal across a broad parameter space when the variance is known. Numerical results support our theoretical findings. The connections established in this paper may open up new avenues for investigating MA from different perspectives. A discussion on some topics for future research concludes the paper.
引用
收藏
页码:3535 / 3572
页数:38
相关论文
共 69 条
[1]   Shape quantization and recognition with randomized trees [J].
Amit, Y ;
Geman, D .
NEURAL COMPUTATION, 1997, 9 (07) :1545-1588
[2]   A Model-Averaging Approach for High-Dimensional Regression [J].
Ando, Tomohiro ;
Li, Ker-Chau .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (505) :254-265
[3]   NEW METHODS OF QUALITY-CONTROL [J].
BARNARD, GA .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-GENERAL, 1963, 126 (02) :255-258
[4]   COMBINATION OF FORECASTS [J].
BATES, JM ;
GRANGER, CWJ .
OPERATIONAL RESEARCH QUARTERLY, 1969, 20 (04) :451-&
[5]   OPTIMAL BOUNDS FOR AGGREGATION OF AFFINE ESTIMATORS [J].
Bellec, Pierre C. .
ANNALS OF STATISTICS, 2018, 46 (01) :30-59
[6]  
Beran R, 1998, ANN STAT, V26, P1826
[7]   On adaptive combination of regression estimators [J].
Blaker, H .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1999, 51 (04) :679-689
[8]  
Breiman L, 1996, MACH LEARN, V24, P49
[9]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[10]   Model selection: An integral part of inference [J].
Buckland, ST ;
Burnham, KP ;
Augustin, NH .
BIOMETRICS, 1997, 53 (02) :603-618