Self-Taught Active Learning from Crowds

被引:33
作者
Fang, Meng [1 ]
Zhu, Xingquan [1 ,2 ]
Li, Bin [1 ]
Ding, Wei [3 ]
Wu, Xindong [4 ,5 ]
机构
[1] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Sys, Sydney, NSW 2007, Australia
[2] Florida Atlantic Univ, Dept Comp & Elect Eng, Dept Comp Sci, Boca Raton, FL 33431 USA
[3] Univ Massachusetts, Dept Comp Sci, Boston, MA 01003 USA
[4] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
[5] Hefei Univ Technol, Dept Comp Sci, Hefei, Peoples R China
来源
12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012) | 2012年
基金
美国国家科学基金会; 澳大利亚研究理事会;
关键词
active learning; crowd; self-taught;
D O I
10.1109/ICDM.2012.64
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emergence of social tagging and crowdsourcing systems provides a unique platform where multiple weak labelers can form a crowd to fulfill a labeling task. Yet crowd labelers are often noisy, inaccurate, and have limited labeling knowledge, and worst of all, they act independently without seeking complementary knowledge from each other to improve labeling performance. In this paper, we propose a Self-Taught Active Learning (STAL) paradigm, where imperfect labelers are able to learn complementary knowledge from one another to expand their knowledge sets and benefit the underlying active learner. We employ a probabilistic model to characterize the knowledge of each labeler through which a weak labeler can learn complementary knowledge from a stronger peer. As a result, the self-taught active learning process eventually helps achieve high classification accuracy with minimized labeling costs and labeling errors.
引用
收藏
页码:858 / 863
页数:6
相关论文
共 15 条
[1]  
[Anonymous], 2010, MACH LEARN
[2]  
[Anonymous], 1999, SPRINGER SCI
[3]  
[Anonymous], 2008, P 17 ACM C INFORM KN
[4]  
Dekel O., 2009, Proc. 26th Annu. Int. Conf. Mach. Learn, P233
[5]  
Fang M, 2012, 26 AAAI C ART INT
[6]  
Frank A., 2010, UCI machine learning repository, V213
[7]  
Howe J., 2008, Crowdsourcing: Why the power of the crowd is driving the future of business, V1
[8]   INFORMATION-BASED OBJECTIVE FUNCTIONS FOR ACTIVE DATA SELECTION [J].
MACKAY, DJC .
NEURAL COMPUTATION, 1992, 4 (04) :590-604
[9]  
Rashidi P., 2011, P KDD
[10]  
Raykar VC., 2009, P 26 ANN INT C MACH, P889, DOI [10.1145/1553374.1553488, DOI 10.1145/1553374.1553488]