Semi-Supervised Classification with Hybrid Generative/Discriminative Methods

被引:0
作者
Druck, Gregory [1 ]
Pal, Chris [1 ]
Zhu, Xiaojin [2 ]
McCallum, Andrew [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
[2] Univ Wisconsin, Madison, WI 53706 USA
来源
KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2007年
基金
美国国家科学基金会;
关键词
Semi-supervised learning; hybrid generative/discriminative methods; text classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We compare two recently proposed frameworks for combining generative and discriminative probabilistic classifiers and apply them to semi-supervised classification. In both cases we explore the tradeoff between maximizing a discriminative likelihood of labeled data and a generative likelihood of labeled and unlabeled data. While prominent semi-supervised learning methods assume low density regions between classes or are subject to generative modeling assumptions, we conjecture that hybrid generative/discriminative methods allow semi-supervised learning in the presence of strongly overlapping classes and reduce the risk of modeling structure in the unlabeled data that is irrelevant for the specific classification task of interest. We apply both hybrid approaches wit in naively structured Markov random field models and provide a thorough empirical comparison with two well-known semi-supervised learning methods on six text classification tasks. A semi-supervised hybrid generative/discriminative method provides the best accuracy in 75% of the experiments, and the multi-conditional learning hybrid approach achieves the highest overall mean accuracy across all tasks.
引用
收藏
页码:280 / +
页数:2
相关论文
共 22 条
[1]  
[Anonymous], NIPS
[2]  
Belkin M., 2004, TR200406 U CHIC
[3]  
BOUCHARD G, P COMP STAT 16 S IAS, V16
[4]  
BREFELD U, 2006, ICML06
[5]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[6]  
Chapelle O., 2006, ADV NEURAL INFORM PR
[7]  
Collobert R, 2006, J MACH LEARN RES, V7, P1687
[8]  
Joachims T, 1999, MACHINE LEARNING, PROCEEDINGS, P200
[9]  
KANG C, 2006, P 19 INT FLAIRS C
[10]  
KELM BM, 2006, ICPR, V2, P828