Improving Supervised Learning with Multiple Clusterings

被引:0
作者
Wemmert, Cedric [1 ]
Forestier, Germain [1 ]
Derivaux, Sebastien [1 ]
机构
[1] Univ Strasbourg, LSIIT, Pole API, F-67412 Illkirch Graffenstaden, France
来源
APPLICATIONS OF SUPERVISED AND UNSUPERVISED ENSEMBLE METHODS | 2009年 / 245卷
关键词
semi-supervised learning; clustering; few labeled data; CLASSIFICATION; SPACES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification task involves inducing a predictive model using a set of labeled samples. The accuracy of the model usually increases as more labeled samples are available. When one has only few samples, the obtained model tends to offer poor results. Even when labeled samples are difficult to get, a lot of unlabeled samples are generally available on which unsupervised learning can be done. In this chapter, a way to combine supervised and unsupervised learning in order to use both labeled and unlabeled samples is explored. The efficiency of the method is evaluated on various UCI datasets and on the classification of a very high resolution remote sensing image when the number of labeled samples is very low.
引用
收藏
页码:135 / 149
页数:15
相关论文
共 20 条
[1]  
Bennett K.P., 2002, P 8 ACM SIGKDD INT C, P289
[2]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[3]  
CAI W, SIMULTANEO IN PRRESS
[4]   Learning from labeled and unlabeled data: An empirical study across techniques and domains [J].
Chawla, N ;
Karakoulas, G .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2005, 23 :331-366
[5]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]  
Deodhar M, 2007, KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P250
[7]  
Goldman S.A., 2000, P 17 INT C MACH LEAR, P327
[8]   Covariance matrix estimation and classification with limited training data [J].
Hoffbeck, JP ;
Landgrebe, DA .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (07) :763-767
[9]   ON MEAN ACCURACY OF STATISTICAL PATTERN RECOGNIZERS [J].
HUGHES, GF .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1968, 14 (01) :55-+
[10]   Cluster-space representation for hyperspectral data classification [J].
Jia, XP ;
Richards, JA .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2002, 40 (03) :593-598