Bayes-Optimal Hierarchical Multilabel Classification

被引:30
作者
Bi, Wei [1 ]
Kwok, Jame T. [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
关键词
Hierarchical classification; multilabel classification; loss function; Bayesian decision theory; LABEL;
D O I
10.1109/TKDE.2015.2441707
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical multilabel classification allows a sample to belong to multiple class labels residing on a hierarchy, which can be a tree or directed acyclic graph (DAG). However, popular hierarchical loss functions, such as the H-loss, can only be defined on tree hierarchies (but not on DAGs), and may also under-or over-penalize misclassifications near the bottom of the hierarchy. Besides, it has been relatively unexplored on how to make use of the loss functions in hierarchical multilabel classification. To overcome these deficiencies, we first propose hierarchical extensions of the Hamming loss and ranking loss which take the mistake at every node of the label hierarchy into consideration. Then, we first train a general learning model, which is independent of the loss function. Next, using Bayesian decision theory, we develop Bayes-optimal predictions that minimize the corresponding risks with the trained model. Computationally, instead of requiring an exhaustive summation and search for the optimal multilabel, the resultant optimization problem can be efficiently solved by a greedy algorithm. Experimental results on a number of real-world data sets show that the proposed Bayes-optimal classifier outperforms state-of-the-art methods.
引用
收藏
页码:2907 / 2918
页数:12
相关论文
共 45 条
[1]  
[Anonymous], 2012, ADV NEURAL INFORM PR
[2]  
[Anonymous], 2011, P 28 INT C MACHINE L
[3]  
[Anonymous], 1999, PROBABILISTIC OUTPUT
[4]  
[Anonymous], 2013, Proceedings of the International Conference on Machine Learning
[5]  
[Anonymous], 2010, Advances in Neural Information Processing Systems
[6]  
[Anonymous], 2005, P SPEC INT TRACKS PO
[7]  
[Anonymous], 2006, Proceedings of the 23rd International Conference on Machine Learning
[8]  
[Anonymous], 2000, Pattern Classification
[9]   Hierarchical multi-label prediction of gene function [J].
Barutcuoglu, Z ;
Schapire, RE ;
Troyanskaya, OG .
BIOINFORMATICS, 2006, 22 (07) :830-836
[10]  
Bi W., 2011, P 28 INT C MACH LEAR, P17, DOI [10.5555/3104482.3104485, DOI 10.5555/3104482.3104485]