Multi-domain learning by confidence-weighted parameter combination

被引:67
作者
Dredze, Mark [1 ]
Kulesza, Alex [2 ]
Crammer, Koby [3 ]
机构
[1] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21211 USA
[2] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[3] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
关键词
Online learning; Domain adaptation; Classifier combination; Transfer learning; Multi-task learning; MULTIPLE CLASSIFIERS;
D O I
10.1007/s10994-009-5148-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art statistical NLP systems for a variety of tasks learn from labeled training data that is often domain specific. However, there may be multiple domains or sources of interest on which the system must perform. For example, a spam filtering system must give high quality predictions for many users, each of whom receives emails from different sources and may make slightly different decisions about what is or is not spam. Rather than learning separate models for each domain, we explore systems that learn across multiple domains. We develop a new multi-domain online learning framework based on parameter combination from multiple classifiers. Our algorithms draw from multi-task learning and domain adaptation to adapt multiple source domain classifiers to a new target domain, learn across multiple similar domains, and learn across a large number of disparate domains. We evaluate our algorithms on two popular NLP domain adaptation tasks: sentiment classification and spam filtering.
引用
收藏
页码:123 / 149
页数:27
相关论文
共 44 条
[1]  
ABERNETHY JD, 2007, UCBEECS200720
[2]  
Ando RK, 2005, J MACH LEARN RES, V6, P1817
[3]  
[Anonymous], 1993, COMPUT LINGUIST, DOI DOI 10.21236/ADA273556
[4]  
[Anonymous], 1998, LEARNING LEARN, DOI DOI 10.1007/978-1-4615-5529-2_8
[5]  
ARNOLD A, 2008, ASS COMPUTATIONAL LI
[6]   Task clustering and gating for Bayesian multitask learning [J].
Bakker, B ;
Heskes, T .
JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (01) :83-99
[7]  
Ben-David S., 2006, Neural Information Processing Systems, V19
[8]  
BICKEL S, 2007, INT C MACH LEARN ICM
[9]  
Bickel S., 2009, ADV NEURAL INFORM PR, P145
[10]  
Blitzer J., 2007, ASS COMPUTATIONAL LI