Cost-sensitive label embedding for multi-label classification

被引:51
作者
Huang, Kuan-Hao [1 ]
Lin, Hsuan-Tien [1 ]
机构
[1] Natl Taiwan Univ, CSIE Dept, Taipei, Taiwan
关键词
Multi-label classification; Cost-sensitive; Label embedding; ANNOTATION;
D O I
10.1007/s10994-017-5659-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Label embedding (LE) is an important family of multi-label classification algorithms that digest the label information jointly for better performance. Different real-world applications evaluate performance by different cost functions of interest. Current LE algorithms often aim to optimize one specific cost function, but they can suffer from bad performance with respect to other cost functions. In this paper, we resolve the performance issue by proposing a novel cost-sensitive LE algorithm that takes the cost function of interest into account. The proposed algorithm, cost-sensitive label embedding with multidimensional scaling (CLEMS), approximates the cost information with the distances of the embedded vectors by using the classic multidimensional scaling approach for manifold learning. CLEMS is able to deal with both symmetric and asymmetric cost functions, and effectively makes cost-sensitive decisions by nearest-neighbor decoding within the embedded vectors. We derive theoretical results that justify how CLEMS achieves the desired cost-sensitivity. Furthermore, extensive experimental results demonstrate that CLEMS is significantly better than a wide spectrum of existing LE algorithms and state-of-the-art cost-sensitive algorithms across different cost functions.
引用
收藏
页码:1725 / 1746
页数:22
相关论文
共 34 条
[1]  
[Anonymous], 2011, P ADV NEUR INF PROC
[2]  
[Anonymous], 2009, Proc. of Neural Information Processing Systems
[3]  
[Anonymous], 2012, ARXIV PREPRINT ARXIV
[4]  
[Anonymous], 2014, INT C MACH LEARN
[5]  
[Anonymous], 2007, INT J DATA WAREHOUSI
[6]  
[Anonymous], 2011, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics
[7]  
[Anonymous], 2013, P 30 INT C INT C MAC
[8]  
[Anonymous], 2008, ISMIR
[9]  
[Anonymous], J MACH LEARN RES
[10]   Hierarchical multi-label prediction of gene function [J].
Barutcuoglu, Z ;
Schapire, RE ;
Troyanskaya, OG .
BIOINFORMATICS, 2006, 22 (07) :830-836