Multi-train: A semi-supervised heterogeneous ensemble classifier

被引:33
作者
Gu, Shenkai [1 ]
Jin, Yaochu [1 ,2 ]
机构
[1] Univ Surrey, Fac Engn & Phys Sci, Dept Comp Sci, Guildford GU2 7XH, Surrey, England
[2] Dalian Univ Technol, Sch Management Sci & Engn, Dalian 116023, Peoples R China
基金
中国国家自然科学基金;
关键词
Unlabeled data; Classification; Heterogeneous ensembles; Semi-supervised learning; Tri-training; Multi-train; NEURAL-NETWORK; ALGORITHM;
D O I
10.1016/j.neucom.2017.03.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real-world machine learning tasks have very limited labeled data but a large amount of unlabeled data. To take advantage of the unlabeled data for enhancing learning performance, several semi supervised learning techniques have been developed. In this paper, we propose a novel semi-supervised ensemble learning algorithm, termed Multi-Train, which generates a number of heterogeneous classifiers that use different classification models and/or different features. During the training process, each classifier is refined using unlabeled data, which are labeled by the majority prediction of the rest classifiers. We hypothesize that the use of different models and different input features can promote the diversity of the ensemble, thereby improving the performance compared to existing methods such as the co-training and tri-training algorithms. Experimental results on the UCI datasets clearly demonstrated the effectiveness of using heterogeneous ensembles in semi-supervised learning. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:202 / 211
页数:10
相关论文
共 34 条
[1]   INSTANCE-BASED LEARNING ALGORITHMS [J].
AHA, DW ;
KIBLER, D ;
ALBERT, MK .
MACHINE LEARNING, 1991, 6 (01) :37-66
[2]   Classifier ensembles for image identification using multi-objective Pareto features [J].
Albukhanajer, Wissam A. ;
Jin, Yaochu ;
Briffa, Johann A. .
NEUROCOMPUTING, 2017, 238 :316-327
[3]   Shape quantization and recognition with randomized trees [J].
Amit, Y ;
Geman, D .
NEURAL COMPUTATION, 1997, 9 (07) :1545-1588
[4]   An ensemble of dynamic neural network identifiers for fault detection and isolation of gas turbine engines [J].
Amozegar, M. ;
Khorasani, K. .
NEURAL NETWORKS, 2016, 76 :106-121
[5]  
[Anonymous], 2005, DATA MINING
[6]  
[Anonymous], 2003, P 20 INT C MACH LEAR
[7]  
[Anonymous], 2000, P INT C MACH LEARN I
[8]  
[Anonymous], 2006, U B C
[9]   Semi-supervised learning on Riemannian manifolds [J].
Belkin, M ;
Niyogi, P .
MACHINE LEARNING, 2004, 56 (1-3) :209-239
[10]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962