A Gaussian Latent Variable Model for Large Margin Classification of Labeled and Unlabeled Data

被引:0
作者
Kim, Do-kyum [1 ]
Der, Matthew [1 ]
Saul, Lawrence K. [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92103 USA
来源
ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33 | 2014年 / 33卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate a Gaussian latent variable model for semi-supervised learning of linear large margin classifiers. The model's latent variables encode the signed distance of examples to the separating hyperplane, and we constrain these variables, for both labeled and unlabeled examples, to ensure that the classes are separated by a large margin. Our approach is based on similar intuitions as semi-supervised support vector machines (S-3 VMs), but these intuitions are formalized in a probabilistic framework. Within this framework we are able to derive an especially simple Expectation-Maximization (EM) algorithm for learning. The algorithm alternates between applying Bayes rule to "fill in" the latent variables (the E-step) and performing an unconstrained least-squares regression to update the weight vector (the M-step). For the best results it is necessary to constrain the unlabeled data to have a similar ratio of positive to negative examples as the labeled data. Within our model this constraint renders exact inference intractable, but we show that a Lyapunov central limit theorem (for sums of independent, but non-identical random variables) provides an excellent approximation to the true posterior distribution. We perform experiments on large-scale text classification and find that our model significantly outperforms existing implementations of S-3 VMs.
引用
收藏
页码:484 / 492
页数:9
相关论文
共 28 条
  • [1] [Anonymous], 1999, Advances in kernel methods: Support vector learning
  • [2] [Anonymous], 2006, BOOK REV IEEE T NEUR
  • [3] [Anonymous], 1999, P 16 INT C MACH LEAR
  • [4] [Anonymous], 1994, TEMPLATES SOLUTION L, DOI DOI 10.1137/1.9781611971538
  • [5] [Anonymous], ICML 2005 WORKSH LEA
  • [6] [Anonymous], ADV NEURAL INFORM PR
  • [7] BENNETT KP, 1998, ADV NEURAL INFORM PR
  • [8] Billingsley Patrick, 1995, Probability and Measure
  • [9] Chapelle O., 2005, P INT WORKSH ART INT
  • [10] Chapelle O, 2008, J MACH LEARN RES, V9, P203