Robust label compression for multi-label classification

被引:5
作者
Zhang, Ju-Jie [1 ]
Fang, Min [1 ]
Wu, Jin-Qiao [1 ]
Li, Xiao [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, 2 South Taibai Rd, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label classification; Label compression; Encoding loss; Dependence loss; Outliers; l(2,1)-norm;
D O I
10.1016/j.knosys.2016.05.051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Label compression (LC) is an effective strategy to reduce time cost and improve classification performance simultaneously for multi-label classification. One main limitation of existing LC methods is that they are prone to outliers. Here outliers include outliers in the feature space and outliers in the label space. Outliers in the feature space are obtained due to data acquisition devices. Outliers in the label space refer to label vectors that are inconsistent with the regular label correlations. In this paper, we propose a new LC method, termed robust label compression (RLC), based on l(2,1)-norm to deal with outliers in the feature space and label space. The objective function of RLC consists of two losses: the encoding loss to measure the compression error and the dependence loss to measure the relevance between the instances and the obtained code vectors after compressing the label vectors. To achieve robustness to outliers, we utilize the l(2,1)-norm on both losses. We propose an efficient optimization algorithm for it and present theoretical analysis. Experiments across six data sets validate the superiority of our proposed method to state-of-art LC methods for multi-label classification. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:32 / 42
页数:11
相关论文
共 39 条
[1]  
[Anonymous], 2002, Principal components analysis
[2]  
[Anonymous], 2009, Proc. of Neural Information Processing Systems
[3]  
[Anonymous], 2010, P ACM SIGKDD
[4]  
[Anonymous], 2007, INT J DATA WAREHOUSI
[5]  
[Anonymous], 2008, ISMIR
[6]  
[Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199
[7]  
[Anonymous], P 2012 INT C MULT RE
[8]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[9]  
Cao L., 2015, PROJECT NO 11 3004, P1
[10]   Training ν-support vector regression:: Theory and algorithms [J].
Chang, CC ;
Lin, CJ .
NEURAL COMPUTATION, 2002, 14 (08) :1959-1977