Adaptive regularization of weight vectors

被引:181
作者
Crammer, Koby [1 ]
Kulesza, Alex [2 ]
Dredze, Mark [3 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
[2] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[3] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21211 USA
关键词
Online learning; Supervised learning; Text classification; Adaptive regularization; PERCEPTRON; MARGIN; NOISE;
D O I
10.1007/s10994-013-5327-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present AROW, an online learning algorithm for binary and multiclass problems that combines large margin training, confidence weighting, and the capacity to handle non-separable data. AROW performs adaptive regularization of the prediction function upon seeing each new instance, allowing it to perform especially well in the presence of label noise. We derive mistake bounds for the binary and multiclass settings that are similar in form to the second order perceptron bound. Our bounds do not assume separability. We also relate our algorithm to recent confidence-weighted online learning techniques. Empirical evaluations show that AROW achieves state-of-the-art performance on a wide range of binary and multiclass tasks, as well as robustness in the face of non-separable data.
引用
收藏
页码:155 / 187
页数:33
相关论文
共 61 条
[1]  
[Anonymous], KDD 2006
[2]  
[Anonymous], 2010, COLT 10 P 23 ANN C L
[3]  
[Anonymous], NEURAL INFORM PROCES
[4]  
[Anonymous], 1997, Parallel Optimization: Theory, Algorithms, and Applications
[5]  
Bekkerman R., 2004, 4181 IR CTR INT INF
[6]   Global discriminative learning for higher-accuracy computational gene prediction [J].
Bernal, Axel ;
Crammer, Koby ;
Hatzigeorgiou, Artemis ;
Pereira, Fernando .
PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (03) :488-497
[7]  
Blitzer J, 2007, BIOGRAPHIES BOLLYWOO
[8]  
Bordes Antoine, 2005, LNAI
[9]  
Cesa-Bianchi N., 2005, SIAM J COMPUTING, P34
[10]   How to use expert advice [J].
CesaBianchi, N ;
Freund, Y ;
Haussler, D ;
Helmbold, DP ;
Schapire, RE ;
Warmuth, MK .
JOURNAL OF THE ACM, 1997, 44 (03) :427-485