How Is a Data-Driven Approach Better than Random Choice in Label Space Division for Multi-Label Classification?

被引:31
作者
Szymanski, Piotr [1 ,2 ]
Kajdanowicz, Tomasz [1 ]
Kersting, Kristian [3 ]
机构
[1] Wroclaw Univ Technol, Dept Computat Intelligence, Wybreze Stanislawa Wyspianskiego 27, PL-50370 Wroclaw, Poland
[2] Illimites Fdn, Gajowicka 64 Lok 1, PL-53422 Wroclaw, Poland
[3] TU Dortmund Univ, Dept Comp Sci, August Schmidt Str 4, D-44221 Dortmund, Germany
基金
欧盟第七框架计划;
关键词
label space clustering; label co-occurrence; label grouping; multi-label classification; clustering; machine learning; random k-label sets; ensemble classification;
D O I
10.3390/e18080282
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We propose using five data-driven community detection approaches from social networks to partition the label space in the task of multi-label classification as an alternative to random partitioning into equal subsets as performed by RAkELd. We evaluate modularity-maximizing using fast greedy and leading eigenvector approximations, infomap, walktrap and label propagation algorithms. For this purpose, we propose to construct a label co-occurrence graph (both weighted and unweighted versions) based on training data and perform community detection to partition the label set. Then, each partition constitutes a label space for separate multi-label classification sub-problems. As a result, we obtain an ensemble of multi-label classifiers that jointly covers the whole label space. Based on the binary relevance and label powerset classification methods, we compare community detection methods to label space divisions against random baselines on 12 benchmark datasets over five evaluation measures. We discover that data-driven approaches are more efficient and more likely to outperform RAkELd than binary relevance or label powerset is, in every evaluated measure. For all measures, apart from Hamming loss, data-driven approaches are significantly better than RAkELd (alpha = 0.05), and at least one data-driven approach is more likely to outperform RAkELd than a priori methods in the case of RAkELd's best performance. This is the largest RAkELd evaluation published to date with 250 samplings per value for 10 values of RAkELd parameter k on 12 datasets published to date.
引用
收藏
页数:30
相关论文
共 31 条
[1]  
[Anonymous], 2007, INT J DATA WAREHOUSI
[2]  
[Anonymous], 2008, ISMIR
[3]  
[Anonymous], 2013, MLSP
[4]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[5]   On modularity clustering [J].
Brandes, Ulrik ;
Delling, Daniel ;
Gaertler, Marco ;
Goerke, Robert ;
Hoefer, Martin ;
Nikoloski, Zoran ;
Wagner, Dorothea .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (02) :172-188
[6]   Acoustic classification of multiple simultaneous bird species: A multi-instance multi-label approach [J].
Briggs, Forrest ;
Lakshminarayanan, Balaji ;
Neal, Lawrence ;
Fern, Xiaoli Z. ;
Raich, Raviv ;
Hadley, Sarah J. K. ;
Hadley, Adam S. ;
Betts, Matthew G. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (06) :4640-4650
[7]  
Clauset A, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.066111
[8]  
Csardi G., 2006, Inter Journal, Complex Systems, P1695
[9]  
Dembczynski K., 2010, PROC INTCONF MACH LE, P279
[10]   On label dependence and loss minimization in multi-label classification [J].
Dembczynski, Krzysztof ;
Waegeman, Willem ;
Cheng, Weiwei ;
Huellermeier, Eyke .
MACHINE LEARNING, 2012, 88 (1-2) :5-45