Adaptive and optimal online linear regression on l1-balls

被引:2
作者
Gerchinovitz, Sebastien [1 ]
Yu, Jia Yuan [2 ]
机构
[1] Ecole Normale Super, F-75005 Paris, France
[2] IBM Res Corp, Dublin 15, Ireland
关键词
Online learning; Linear regression; Adaptive algorithms; Minimax regret; GRADIENT; BOUNDS; RATES;
D O I
10.1016/j.tcs.2013.09.024
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of online linear regression on individual sequences. The goal in this paper is for the forecaster to output sequential predictions which are, after T time rounds, almost as good as the ones output by the best linear predictor in a given l(1)-ball in R-d. We consider both the cases where the dimension d is small and large relative to the time horizon T. We first present regret bounds with optimal dependencies on d. T, and on the sizes U, X and Y of the l(1)-ball, the input data and the observations. The minimax regret is shown to exhibit a regime transition around the point d = root TUX/(2Y). Furthermore, we present efficient algorithms that are adaptive, i.e., that do not require the knowledge of U, X, Y, and T, but still achieve nearly optimal regret bounds. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:4 / 28
页数:25
相关论文
共 22 条
[1]   Adaptive and self-confident on-line learning algorithms [J].
Auer, P ;
Cesa-Bianchi, N ;
Gentile, C .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 64 (01) :48-75
[2]   Relative loss bounds for on-line density estimation with the exponential family of distributions [J].
Azoury, KS ;
Warmuth, MK .
MACHINE LEARNING, 2001, 43 (03) :211-246
[3]  
Birge L., 2001, J. Eur. Math. Soc, P203
[4]   Sequential procedures for aggregating arbitrary estimators of a conditional mean [J].
Bunea, Florentina ;
Nobel, Andrew .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2008, 54 (04) :1725-1735
[5]   Minimizing regret with label efficient prediction [J].
Cesa-Bianchi, N ;
Lugosi, G ;
Stoltz, G .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (06) :2152-2162
[6]   Analysis of two gradient-based algorithms for on-line regression [J].
Cesa-Bianchi, N .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1999, 59 (03) :392-411
[7]  
Cesa-Bianchi N, 2006, Prediction, Learning, and Games
[8]   Improved second-order bounds for prediction with expert advice [J].
Cesa-Bianchi, Nicolo ;
Mansour, Yishay ;
Stoltz, Gilles .
MACHINE LEARNING, 2007, 66 (2-3) :321-352
[9]  
Duchi JC., 2010, COLT, P14
[10]   The robustness of the p-norm algorithms [J].
Gentile, C .
MACHINE LEARNING, 2003, 53 (03) :265-299