Safety-aware Graph-based Semi-Supervised Learning

被引:30
作者
Gan, Haitao [1 ,2 ]
Li, Zhenhua [1 ]
Wu, Wei [1 ]
Luo, Zhizeng [1 ]
Huang, Rui [3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Automat, Hangzhou, Zhejiang, Peoples R China
[2] MOE Key Lab Image Proc & Intelligence Control, Wuhan, Hubei, Peoples R China
[3] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen 518172, Guangdong, Peoples R China
关键词
Semi-supervised learning; Graph composite; Safety mechanism; Laplacian support vector machine; CLASSIFICATION;
D O I
10.1016/j.eswa.2018.04.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning field, Graph-based Semi-Supervised Learning (GSSL) has recently attracted much attention and many researchers have proposed a number of different methods. GSSL generally constructs a k nearest neighbors graph to explore manifold structure which may improve learning performance of GSSL. If one uses an inappropriate graph to learn a semi-supervised classifier, the performance of the classifier may be worse than that of supervised learning (SL) only trained by labeled samples. Hence, it is worthy to design a safe version to broaden the application area of GSSL. In this paper, we introduce a Safety-aware GSSL (SaGSSL) method which can adaptively select the good graphs and learn a safe semi-supervised classifier simultaneously. The basic assumption is that a graph has a high quality if the sample margin obtained by GSSL with the graph is larger than that obtained by SL. By identifying the high-quality graphs and setting the corresponding weights large, the predictions of our algorithm will approach to those of GSSL with the graphs. Meanwhile, the weights of the low-quality graphs should be small and the predictions of our algorithm will be close to those of SL. Hence the degeneration probability will be reduced and our algorithm is expected to realize the goal of safe exploitation of different graphs. Experimental results on several datasets show that our algorithm can simultaneously implement the graph selection and safely exploit the unlabeled samples. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 45 条
[1]  
Anis A., 2017, SAMPLING THEORY PERS
[2]  
[Anonymous], 2017, CORR
[3]  
[Anonymous], 2006, BOOK REV IEEE T NEUR
[4]  
[Anonymous], 2001, P 18 INT C MACH LEAR
[5]  
[Anonymous], 2003, P 20 INT C MACH LEAR
[6]  
[Anonymous], 2009, Advances in neural information processing systems
[7]   Fuzziness based semi-supervised learning approach for intrusion detection system [J].
Ashfaq, Rana Aamir Raza ;
Wang, Xi-Zhao ;
Huang, Joshua Zhexue ;
Abbas, Haider ;
He, Yu-Lin .
INFORMATION SCIENCES, 2017, 378 :484-497
[8]   Semi-supervised clustering methods [J].
Bair, Eric .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05) :349-361
[9]  
Basu S., 2002, P INT C MACH LEARN, P27
[10]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399