The Peaking Phenomenon in Semi-supervised Learning

被引:2
作者
Krijthe, Jesse H. [1 ,2 ]
Loog, Marco [1 ,3 ]
机构
[1] Delft Univ Technol, Pattern Recognit Lab, Delft, Netherlands
[2] Leiden Univ, Med Ctr, Dept Mol Epidemiol, Leiden, Netherlands
[3] Univ Copenhagen, Image Sect, Copenhagen, Denmark
来源
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016 | 2016年 / 10029卷
关键词
Semi-supervised learning; Peaking; Least squares classifier; Pseudo-inverse; CLASSIFIERS; ERROR;
D O I
10.1007/978-3-319-49055-7_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For the supervised least squares classifier, when the number of training objects is smaller than the dimensionality of the data, adding more data to the training set may first increase the error rate before decreasing it. This, possibly counterintuitive, phenomenon is known as peaking. In this work, we observe that a similar but more pronounced version of this phenomenon also occurs in the semi-supervised setting, where instead of labeled objects, unlabeled objects are added to the training set. We explain why the learning curve has a more steep incline and a more gradual decline in this setting through simulation studies and by applying an approximation of the learning curve based on the work by Raudys and Duin.
引用
收藏
页码:299 / 309
页数:11
相关论文
共 16 条
[1]  
[Anonymous], 2006, BOOK REV IEEE T NEUR
[2]  
Duin RPW, 2000, INT C PATT RECOG, P1, DOI 10.1109/ICPR.2000.906006
[3]  
DUIN RPW, 1995, P 9 SCAND C IM AN JU, P957
[4]  
Fan B., 2008, INT C AUT FAC GEST R, P1
[5]   ON MEAN ACCURACY OF STATISTICAL PATTERN RECOGNIZERS [J].
HUGHES, GF .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1968, 14 (01) :55-+
[6]  
Jain A. K., 1982, Handbook of Statistics, V2, P835, DOI [DOI 10.1016/S0169-7161, 10.1016/S0169-7161]
[7]   Implicitly Constrained Semi-supervised Least Squares Classification [J].
Krijthe, Jesse H. ;
Loog, Marco .
ADVANCES IN INTELLIGENT DATA ANALYSIS XIV, 2015, 9385 :158-169
[8]  
LICHMAN M., 2013, UCI MACHINE LEARNING
[9]  
Loog M, 2012, LECT NOTES COMPUT SC, V7626, P310, DOI 10.1007/978-3-642-34166-3_34
[10]  
Opper M., 2001, FRONTIERS LIFE INTEL, P763