Improving semi-supervised learning through optimum connectivity

被引:43
作者
Amorim, Willian P. [1 ]
Falcao, Alexandre X. [2 ]
Papa, Joao P. [3 ]
Carvalho, Marcelo H. [1 ]
机构
[1] Fed Univ Mato Grosso UFMS, FACOM Inst Comp, Cidade Univ, BR-79070900 Campo Grande, MS, Brazil
[2] Univ Estadual Campinas, Inst Comp, Dept Informat Syst, Av Albert Einstein 1251, BR-13083852 Campinas, SP, Brazil
[3] Sao Paulo State Univ, Dept Comp, Av Eng Luiz Edmundo Carrijo Coube 14-01, BR-17033360 Bauru, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Semi-supervised learning; Optimum-path forest classifiers; MANIFOLD REGULARIZATION; CLASSIFICATION; MACHINE;
D O I
10.1016/j.patcog.2016.04.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The annotation of large data sets by a classifier is a problem whose challenge increases as the number of labeled samples used to train the classifier reduces in comparison to the number of unlabeled samples. In this context, semi-supervised learning methods aim at discovering and labeling informative samples among the unlabeled ones, such that their addition to the correct class in the training set can improve classification performance. We present a semi-supervised learning approach that connects unlabeled and labeled samples as nodes of a minimum-spanning tree and partitions the tree into an optimum-path forest rooted at the labeled nodes. It is suitable when most samples from a same class are more closely connected through sequences of nearby samples than samples from distinct classes, which is usually the case in data sets with a reasonable relation between number of samples and feature space dimension. The proposed solution is validated by using several data sets and state-of-the-art methods as baselines. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:72 / 85
页数:14
相关论文
共 44 条
[1]  
Alimoglu F., 1996, P 5 TURKISH ARTIFICI
[2]  
Amorim W. P., 2010, Proceedings of the 23rd SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI 2010), P353, DOI 10.1109/SIBGRAPI.2010.54
[3]   Semi-Supervised Pattern Classification Using Optimum-Path Forest [J].
Amorim, Willian P. ;
Falcao, Alexandre X. ;
Carvalho, Marcelo H. .
2014 27TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2014, :111-118
[4]  
[Anonymous], 2009, AI STAT
[5]  
[Anonymous], 2008, SEMISUPERVISED LEARN
[6]  
[Anonymous], 2003, P 20 INT C MACH LEAR
[7]  
Basu S., 2002, P INT C MACH LEARN, P27
[8]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[9]  
Bickel Peter J., 2007, Lecture Notes-Monograph Series, P177, DOI DOI 10.1214/074921707000000148
[10]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962