An active learning method based on three-way decision model

被引:0
作者
Hu F. [1 ,2 ]
Zhang M. [1 ,2 ]
Yu H. [1 ,2 ]
机构
[1] School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing
[2] Key Laboratory of Computational Intelligence, Chongqing University of Posts and Telecommunications, Chongqing
来源
Kongzhi yu Juece/Control and Decision | 2019年 / 34卷 / 04期
关键词
Active learning; Decision function; Machine learning; Three-way decision; Uncertainty; Unlabeled samples;
D O I
10.13195/j.kzyjc.2017.1342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learning is one of the focuses in the field of machine learning, aiming to solve the unlabeled problem of samples. In this paper, a three-way decision model is applied to active learning. By introducing decision functions, the unlabeled samples are divided into three different parts: positive region, boundary region and negative region based on the uncertainty of unlabeled samples. Different solutions are adopted to process samples for each region. Then, an active learning method based on the three-way decision model, namely TWD_ctive, is developed. The most useful samples are selected using the active learning method, and are labeled by experts, so more effective models can be trained by the expanded training set. Compared with traditional passive learning, this method can choose the informational and representative samples to label, avoiding the redundant addition of sample. The models are continuously trained until the expected number of iterations or performance indicators are achieved. Experimental results show that the proposed algorithm has a better performance in measures F-value, AUC and the effectiveness of the algorithm is verified. © 2019, Editorial Office of Control and Decision. All right reserved.
引用
收藏
页码:718 / 726
页数:8
相关论文
共 32 条
[1]  
Gong X.J., Sun J.P., Shi Z.Z., Active bayesian network classifier, Computer Research and Development, 39, 5, pp. 574-579, (2002)
[2]  
Culotta A., McCallum A., Reducing labeling effort for structured prediction tasks, Proc of AAAI 2005, pp. 746-751, (2005)
[3]  
Lewis D., Catlett J., Heterogeneous uncertainty sampling for supervised learning, Proc of ICML 1994, pp. 148-156, (1994)
[4]  
Scheffer T., Decomain C., Wrobel S., Active hidden Markov models for information extraction, Int Conf on Advances in Intelligent Data Analysis, pp. 309-318, (2001)
[5]  
Settles B., Active learning literature survey, University of Wisconsinmadison, 39, 2, pp. 127-131, (2009)
[6]  
Tong S., Koller D., Support vector machine active learning with applications to text classification, J of Machine Learning Research, 2, 1, pp. 45-66, (2001)
[7]  
Zhang C., Chen T., An active learni ng framework for content based information retrieval, IEEE Trans on Multimedia, 4, 2, pp. 260-268, (2002)
[8]  
Xu Z., Yu K., Tresp V., Et al., Representative sampling for text classification using support vector machines, European Conf on IR Research, pp. 393-407, (2003)
[9]  
Donmez P., Carbonell J.G., Bennett P.N., Dual strategy active learning, Proc of the 18th European Conf on Machine Learning, pp. 208-215, (2008)
[10]  
Hoi S.C.H., Jin R., Zhu J., Et al., Semi-supervised svm batch mode active learning for image retrieval, Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recongnition, pp. 1-7, (2008)