A reliable ensemble based approach to semi-supervised learning

被引:18
作者
de Vries, Sjoerd [1 ,2 ]
Thierens, Dirk [1 ]
机构
[1] Univ Utrecht, Princetonpl 5, NL-3584 CC Utrecht, Netherlands
[2] UMC Utrecht, Heidelberglaan 100, NL-3584 CX Utrecht, Netherlands
关键词
Ensemble learning; Out-of-bag error; Ranking; Self-training; Semi-supervised learning; Wrapper; CLASSIFICATION; REPOSITORY; SOFTWARE;
D O I
10.1016/j.knosys.2021.106738
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) methods attempt to achieve better classification of unseen data through the use of unlabeled data than can be achieved by learning from the available labeled data alone. Most SSL methods require the user to familiarize themselves with novel, complex concepts and to ensure the underlying assumptions made by these methods match the problem structure, or they risk a decrease in predictive performance. In this paper, we present the reliable semi-supervised ensemble learning (RESSEL) method, which exploits unlabeled data by using it to generate diverse classifiers through self-training and combines these classifiers into an ensemble for prediction. Our method functions as a wrapper around a supervised base classifier and refrains from introducing additional problem dependent assumptions. We conduct experiments on a number of commonly used data sets to prove its merit. The results show RESSEL improves significantly upon the supervised alternatives, provided that the base classifier which is used is able to produce adequate probability-based rankings. It is shown that RESSEL is reliable in that it delivers results comparable to supervised learning methods if this requirement is not met, while the method also broadens the range of good parameter values. Furthermore, RESSEL is demonstrated to outperform existing self-labeled wrapper approaches. (C) 2021 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:17
相关论文
共 71 条
[1]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[2]  
[Anonymous], 1999, ARTIFICIAL NEURAL NE
[3]  
[Anonymous], 2002, P ACM SIGKDD INT C K, DOI DOI 10.1145/775047.775090
[4]  
[Anonymous], 2005, SEMISUPERVISED LEARN
[5]  
[Anonymous], 2005, Information Fusion, DOI https://doi.org/10.1016/j.inffus.2004.04.009
[6]  
[Anonymous], 1995, P ACL
[7]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[8]  
Bennett KP, 1999, ADV NEUR IN, V11, P368
[9]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[10]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140