Labelset topic model for multi-label document classification

被引:9
作者
Li, Ximing [1 ,2 ]
Ouyang, Jihong [1 ,2 ]
Zhou, Xiaotang [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130023, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130023, Peoples R China
关键词
Multi-label classification; Topic model; Labelset; Label dependency;
D O I
10.1007/s10844-014-0352-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It has recently been suggested that assuming independence between labels is not suitable for real-world multi-label classification. To account for label dependencies, this paper proposes a supervised topic modeling algorithm, namely labelset topic model (LsTM). Our algorithm uses two labelset layers to capture label dependencies. LsTM offers two major advantages over existing supervised topic modeling algorithms: it is straightforward to interpret and it allows words to be assigned to combinations of labels, rather than a single label. We have performed extensive experiments on several well-known multilabel datasets. Experimental results indicate that the proposed model achieves performance on par with and often exceeding that of state-of-the-art methods both qualitatively and quantitatively.
引用
收藏
页码:83 / 97
页数:15
相关论文
共 35 条
[1]  
[Anonymous], NEURAL INFORM PROCES
[2]  
[Anonymous], NEUR INF PROC SYST W
[3]  
[Anonymous], 2001, Lecture Notes in Computer Science
[4]  
[Anonymous], 2006, ICML, DOI DOI 10.1145/1143844.1143917
[5]  
[Anonymous], INT C MUS INF RETR
[6]  
[Anonymous], C COMP NAT LANG LEAR
[7]   A CORRELATED TOPIC MODEL OF SCIENCE [J].
Blei, David M. ;
Lafferty, John D. .
ANNALS OF APPLIED STATISTICS, 2007, 1 (01) :17-35
[8]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[9]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[10]  
Brinker K, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P702