Aspects of Semi-supervised and Active Learning in Conditional Random Fields

被引:0
作者
Sokolovska, Nataliya [1 ]
机构
[1] Univ Paris 11, LRI, CNRS, UMR 8623, F-91405 Orsay, France
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III | 2011年 / 6913卷
关键词
conditional random fields; probability of observations; active learning; semi-supervised learning; RECOGNITION; LIKELIHOOD;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conditional random fields are among the state-of-the art approaches to structured output prediction, and the model has been adopted for various real-world problems. The supervised classification is expensive, since it is usually expensive to produce labelled data. Unlabeled data are relatively cheap, but how to use it? Unlabeled data can be used to estimate marginal probability of observations, and we exploit this idea in our work. Introduction of unlabeled data and of probability of observations into a purely discriminative model is a challenging task. We consider an extrapolation of a recently proposed semi-supervised criterion to the model of conditional random fields, and show its drawbacks. We discuss alternative usage of the marginal probability and propose a pool-based active learning approach based on quota sampling. We carry out experiments on synthetic as well as on standard natural language data sets, and we show that the proposed quota sampling active learning method is efficient.
引用
收藏
页码:273 / 288
页数:16
相关论文
共 33 条
[1]  
[Anonymous], 2006, BOOK REV IEEE T NEUR
[2]  
[Anonymous], COMPLEX SYSTEMS
[3]  
[Anonymous], 2006, CVPR
[4]  
[Anonymous], 2006, INTRO STAT RELATIONA
[5]  
[Anonymous], TECHNICAL REPORT
[6]  
[Anonymous], NIPS
[7]  
[Anonymous], 2007, ICML
[8]  
[Anonymous], 2002, NIPS
[9]  
Bouchard G., 2004, IASC
[10]  
Brefeld U., 2006, ICML, P2