Sequential procedures for aggregating arbitrary estimators of a conditional mean

被引:18
作者
Bunea, Florentina [1 ]
Nobel, Andrew [2 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[2] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
aggregation; Bayesian model averaging; individual sequence; oracle inequality; prediction; regression;
D O I
10.1109/TIT.2008.917657
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this correspondence, a sequential procedure for aggregating linear combinations of a finite family of regression estimates is described and analyzed. Particular attention is given to linear combinations having coefficients in the generalized simplex. The procedure is based on exponential weighting, and has a computationally tractable approximation. Analysis of the procedure is based in part on techniques from the sequential prediction of nonrandom sequences. Here these techniques are applied in a stochastic setting to obtain cumulative loss bounds for the aggregation procedure. From the cumulative loss bounds we derive an oracle inequality for the aggregate estimator for an unbounded response having a suitable moment-generating function. The inequality shows that the risk of the aggregate estimator is less than the risk of the best candidate linear combination in the generalized simplex, plus a complexity term that depends on the size of the coefficient set. The inequality readily yields convergence rates for aggregation over the unit simplex that are within logarithmic factors of known minimax bounds. Some preliminary results on model selection are also presented.
引用
收藏
页码:1725 / 1735
页数:11
相关论文
共 37 条
[1]  
[Anonymous], LECT NOTES MATH
[2]  
AUDIBERT JY, 2003, PROBAB STAT, V40, P685
[3]   Relative loss bounds for on-line density estimation with the exponential family of distributions [J].
Azoury, KS ;
Warmuth, MK .
MACHINE LEARNING, 2001, 43 (03) :211-246
[4]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[5]   Model selection via testing:: an alternative to (penalized) maximum likelihood estimators [J].
Birgé, L .
ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2006, 42 (03) :273-325
[6]   Aggregation for gaussian regression [J].
Bunea, Florentina ;
Tsybakov, Alexandre B. ;
Wegkamp, Marten H. .
ANNALS OF STATISTICS, 2007, 35 (04) :1674-1697
[7]  
Cesa-Bianchi N, 1999, ANN STAT, V27, P1865
[8]  
Cesa-Bianchi N, 2006, PREDICTION LEARNING
[9]   On-line prediction and conversion strategies [J].
CesaBianchi, N ;
Freund, Y ;
Helmbold, DP ;
Warmuth, MK .
MACHINE LEARNING, 1996, 25 (01) :71-110
[10]   How to use expert advice [J].
CesaBianchi, N ;
Freund, Y ;
Haussler, D ;
Helmbold, DP ;
Schapire, RE ;
Warmuth, MK .
JOURNAL OF THE ACM, 1997, 44 (03) :427-485