Crowd labeling latent Dirichlet allocation

被引:0
作者
Luca Pion-Tonachini
Scott Makeig
Ken Kreutz-Delgado
机构
[1] University of California at San Diego,Department of Electrical and Computer Engineering
[2] University of California at San Diego,Swartz Center for Computational Neuroscience
[3] University of California at San Diego,Calit2/QI Pattern Recognition Laboratory
来源
Knowledge and Information Systems | 2017年 / 53卷
关键词
Crowd labeling; Generative model; Bayesian; Latent Dirichlet allocation; EEG;
D O I
暂无
中图分类号
学科分类号
摘要
Large, unlabeled datasets are abundant nowadays, but getting labels for those datasets can be expensive and time-consuming. Crowd labeling is a crowdsourcing approach for gathering such labels from workers whose suggestions are not always accurate. While a variety of algorithms exist for this purpose, we present crowd labeling latent Dirichlet allocation (CL-LDA), a generalization of latent Dirichlet allocation that can solve a more general set of crowd labeling problems. We show that it performs as well as other methods and at times better on a variety of simulated and actual datasets while treating each label as compositional rather than indicating a discrete class. In addition, prior knowledge of workers’ abilities can be incorporated into the model through a structured Bayesian framework. We then apply CL-LDA to the EEG independent component labeling dataset, using its generalizations to further explore the utility of the algorithm. We discuss prospects for creating classifiers from the generated labels.
引用
收藏
页码:749 / 765
页数:16
相关论文
共 10 条
[1]  
Blei DM(2003)Latent Dirichlet allocation J Mach Learn Res 3 993-1022
[2]  
Ng AY(1979)Maximum likelihood estimation of observer error-rates using the EM algorithm Appl Stat 28 20-28
[3]  
Jordan MI(2004)Finding scientific topics Proc Natl Acad Sci 101 5228-5235
[4]  
Dawid AP(2010)Semantic annotation of satellite images using latent Dirichlet allocation IEEE Geosci Remote Sens Lett 7 28-32
[5]  
Skene AM(undefined)undefined undefined undefined undefined-undefined
[6]  
Griffiths TL(undefined)undefined undefined undefined undefined-undefined
[7]  
Steyvers M(undefined)undefined undefined undefined undefined-undefined
[8]  
Lienou M(undefined)undefined undefined undefined undefined-undefined
[9]  
Maître H(undefined)undefined undefined undefined undefined-undefined
[10]  
Datcu M(undefined)undefined undefined undefined undefined-undefined