ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification

被引:73
作者
Wu, Qingyao [1 ]
Tan, Mingkui [1 ]
Song, Hengjie [1 ]
Chen, Jian [1 ]
Ng, Michael K. [2 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou 510641, Guangdong, Peoples R China
[2] Hong Kong Baptist Univ, Dept Math, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label classification; label dependency; label transfer; tree classifier; ensemble methods;
D O I
10.1109/TKDE.2016.2581161
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification deals with the problem where each example is associated with multiple class labels. Since the labels are often dependent to other labels, exploiting label dependencies can significantly improve the multi-label classification performance. The label dependency in existing studies is often given as prior knowledge or learned from the labels only. However, in many real applications, such prior knowledge may not be available, or labeled information might be very limited. In this paper, we propose a new algorithm, called ML-FOREST, to learn an ensemble of hierarchical multi-label classifier trees to reveal the intrinsic label dependencies. In ML-FOREST, we construct a set of hierarchical trees, and develop a label transfer mechanism to identify the multiple relevant labels in a hierarchical way. In general, the relevant labels at higher levels of the trees capture more discriminable label concepts, and they will be transferred into lower level children nodes that are harder to discriminate. The relevant labels in the hierarchy are then aggregated to compute label dependency and make the final prediction. Our empirical study shows encouraging results of the proposed algorithm in comparison with the state-of-the-art multi-label classification algorithms under Friedman test and post-hoc Nemenyi test.
引用
收藏
页码:2665 / 2680
页数:16
相关论文
共 50 条
[1]  
Agrawal R., 2013, P 22 INT C WORLD WID, P13
[2]  
[Anonymous], 2011, Advances in Neural Information Processing Systems
[3]  
[Anonymous], 2001, Lecture Notes in Computer Science
[4]  
[Anonymous], 2010, Advances in Neural Information Processing Systems
[5]  
Bi Wei, 2011, P 28 INT C INT C MAC, P17
[6]  
Bin Fu, 2012, Advances in Knowledge Discovery and Data Mining. Proceedings 16th Pacific-Asia Conference (PAKDD 2012), P159, DOI 10.1007/978-3-642-30217-6_14
[7]  
Blockeel H., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P55
[8]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Matrix Completion for Weakly-Supervised Multi-Label Image Classification [J].
Cabral, Ricardo ;
De la Torre, Fernando ;
Costeira, Joao Paulo ;
Bernardino, Alexandre .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (01) :121-135