Lower bounds on individual sequence regret

被引：3

作者：

Gofer, Eyal ^{[1
]}

Mansour, Yishay ^{[2
,3
]}

机构：

[1] Hebrew Univ Jerusalem, Rachel & Selim Benin Sch Comp Sci & Engn, IL-91904 Jerusalem, Israel

[2] Microsoft Res, Herzliyya, Israel

[3] Tel Aviv Univ, Blavatnik Sch Comp Sci, IL-69978 Tel Aviv, Israel

来源：

MACHINE LEARNING | 2016年 / 103卷 / 01期

基金：

以色列科学基金会;

关键词：

Regret minimization; Online learning; Online linear optimization; Regret lower bounds; Regularized Follow the Leader;

D O I：

10.1007/s10994-015-5531-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work we lower bound the individual sequence anytime regret of a large family of online algorithms. This bound depends on the quadratic variation of the sequence, Q(T), and the learning rate. Nevertheless, we showthat any learning rate that guarantees a regret upper bound of O(root Q(T)) necessarily implies an Omega(root Q(T)) anytime regret on any sequence with quadratic variation Q(T). The algorithms we consider are online linear optimization forecasters whose weight vector at time t + 1 is the gradient of a concave potential function of cumulative losses at time t. We show that these algorithms include all linear Regularized Follow the Leader algorithms. We prove our result for the case of potentials with negative definite Hessians, and potentials for the best expert setting satisfying some natural regularity conditions. In the best expert setting, we give our result in terms of the translation-invariant relative quadratic variation. We apply our lower bounds to Randomized Weighted Majority and to linear cost Online Gradient Descent. We show that our analysis can be generalized to accommodate diverse measures of variation beside quadratic variation. We apply this generalized analysis to Online Gradient Descent with a regret upper bound that depends on the variance of losses.

引用

页码：1 / 26

页数：26

共 18 条

[1]

Boyd S, 2004, CONVEX OPTIMIZATION

[2]

Cesa-Bianchi N., 2006, PREDICTION LEARNING

[3] Improved second-order bounds for prediction with expert advice [J].

Cesa-Bianchi, Nicolo ;

Mansour, Yishay ;

Stoltz, Gilles .

MACHINE LEARNING, 2007, 66 (2-3) :321-352

[4]

Chiang C. K., 2012, J MACHINE LEARNING R, V23

[5]

DEMARZO P, 2006, P ACM S THEOR COMP S, P477

[6] Regret to the best vs. regret to the average [J].

Even-Dar, Eyal ;

Kearns, Michael ;

Mansour, Yishay ;

Wortman, Jennifer .

MACHINE LEARNING, 2008, 72 (1-2) :21-37

[7]

Gofer Eyal, 2012, Algorithmic Learning Theory. 23rd International Conference (ALT 2012). Proceedings, P275, DOI 10.1007/978-3-642-34106-9_23

[8]

Gofer E., 2014, THESIS TEL AVIV U

[9]

Gofer E., 2014, J MACHINE LEARNING R, V35, P210

[10]

Hazan E., 2006, THESIS PRINCETON U

← 1 2 →