PruDent: A Pruned and Confident Stacking Approach for Multi-Label Classification

被引:26
作者
Alali, Abdulaziz [1 ]
Kubat, Miroslav [1 ]
机构
[1] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33146 USA
关键词
Multi-label classification; chaining; stacking; label dependence; TEXT-CATEGORIZATION; DECISION TREES; INDUCTION;
D O I
10.1109/TKDE.2015.2416731
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past decade or so, several research groups have addressed the problem of multi-label classification where each example can belong to more than one class at the same time. A common approach, called Binary Relevance (BR), addresses this problem by inducing a separate classifier for each class. Research has shown that this framework can be improved if mutual class dependence is exploited: an example that belongs to class X is likely to belong also to class Y; conversely, belonging to X can make an example less likely to belong to Z. Several works sought to model this information by using the vector of class labels as additional example attributes. To fill the unknown values of these attributes during prediction, existing methods resort to using outputs of other classifiers, and this makes them prone to errors. This is where our paper wants to contribute. We identified two potential ways to prune unnecessary dependencies and to reduce error-propagation in our new classifier-stacking technique, which is named PruDent. Experimental results indicate that the classification performance of PruDent compares favorably with that of other state-of-the-art approaches over a broad range of testbeds. Moreover, its computational costs grow only linearly in the number of classes.
引用
收藏
页码:2480 / 2493
页数:14
相关论文
共 41 条
[1]   Incorporating label dependency into the binary relevance framework for multi-label classification [J].
Alvares-Cherman, Everton ;
Metz, Jean ;
Monard, Maria Carolina .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) :1647-1655
[2]  
[Anonymous], 2010, P ACM SIGKDD
[3]  
[Anonymous], 2001, Lecture Notes in Computer Science
[4]  
[Anonymous], 2005, PROC 14 ACM INT C I, DOI DOI 10.1145/1099554.1099591
[5]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[6]   Exploiting label dependencies for improved sample complexity [J].
Chekina, Lena ;
Gutfreund, Dan ;
Kontorovich, Aryeh ;
Rokach, Lior ;
Shapira, Bracha .
MACHINE LEARNING, 2013, 91 (01) :1-42
[7]  
De Comité F, 2003, LECT NOTES ARTIF INT, V2734, P35
[8]  
Dembczynski K., 2010, PROC INTCONF MACH LE, P279
[9]   An Analysis of Chaining in Multi-Label Classification [J].
Dembczynski, Krzysztof ;
Waegeman, Willem ;
Huellermeir, Eyke .
20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 :294-+
[10]   On label dependence and loss minimization in multi-label classification [J].
Dembczynski, Krzysztof ;
Waegeman, Willem ;
Cheng, Weiwei ;
Huellermeier, Eyke .
MACHINE LEARNING, 2012, 88 (1-2) :5-45