Semi-supervised Learning Using Siamese Networks

被引:8
作者
Sahito, Attaullah [1 ]
Frank, Eibe [1 ]
Pfahringer, Bernhard [1 ]
机构
[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand
来源
AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年 / 11919卷
关键词
Semi-supervised learning; Siamese networks; Triplet loss; LLGC;
D O I
10.1007/978-3-030-35288-2_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks have been successfully used as classification models yielding state-of-the-art results when trained on a large number of labeled samples. These models, however, are more difficult to train successfully for semi-supervised problems where small amounts of labeled instances are available along with a large number of unlabeled instances. This work explores a new training method for semi-supervised learning that is based on similarity function learning using a Siamese network to obtain a suitable embedding. The learned representations are discriminative in Euclidean space, and hence can be used for labeling unlabeled instances using a nearest-neighbor classifier. Confident predictions of unlabeled instances are used as true labels for retraining the Siamese network on the expanded training set. This process is applied iteratively. We perform an empirical study of this iterative self-training algorithm. For improving unlabeled predictions, local learning with global consistency [22] is also evaluated.
引用
收藏
页码:586 / 597
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 2006, Semi-Supervised Learning, DOI DOI 10.7551/MITPRESS/9780262033589.003.0003
[2]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[3]  
Brefeld U, 2004, P 21 INT C MACH LEAR, P16
[4]  
Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339
[5]  
Chapelle O., 2009, Semi-Supervised Learning, V20, P542, DOI 10.1109/TNN.2009.2015974
[6]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[7]  
Dai Z., 2017, P ADV NEUR INF PROC, P6510
[8]  
Hoffer E, 2018, Arxiv, DOI arXiv:1611.01449
[9]  
Kingma DP, 2014, ADV NEUR IN, V27
[10]  
Laine S, 2017, Arxiv, DOI arXiv:1610.02242