Leveraging Label-Specific Discriminant Mapping Features for Multi-Label Learning

被引:31
作者
Guo, Yumeng [1 ,2 ]
Chung, Fulai [2 ]
Li, Guozheng [1 ,3 ]
Wang, Jiancong [4 ]
Gee, James C. [4 ]
机构
[1] Tongji Univ, Dept Control Sci & Engn, Shanghai, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[3] China Acad Chinese Med Sci, Data Ctr Tradit Chinese Med, Beijing, Peoples R China
[4] Univ Penn, Dept Radiol, Penn Image Comp & Sci Lab, Philadelphia, PA 19104 USA
基金
国家重点研发计划;
关键词
Machine learning; multi-label learning; label specific features; CLASSIFICATION; SYSTEM;
D O I
10.1145/3319911
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an important machine learning task, multi-label learning deals with the problem where each sample instance (feature vector) is associated with multiple labels simultaneously. Most existing approaches focus on manipulating the label space, such as exploiting correlations between labels and reducing label space dimension, with identical feature space in the process of classification. One potential drawback of this traditional strategy is that each label might have its own specific characteristics and using identical features for all label cannot lead to optimized performance. In this article, we propose an effective algorithm named LSDM, i.e., leveraging label-specific discriminant mapping features for multi-label learning, to overcome the drawback. LSDM sets diverse ratio parameter values to conduct cluster analysis on the positive and negative instances of identical label. It reconstructs label-specific feature space which includes distance information and spatial topology information. Our experimental results show that combining these two parts of information in the new feature representation can better exploit the clustering results in the learning process. Due to the problem of diverse combinations for identical label, we employ simplified linear discriminant analysis to efficiently excavate optimal one for each label and perform classification by querying the corresponding results. Comparison with the state-of-the-art algorithms on a total of 20 benchmark datasets clearly manifests the competitiveness of LSDM.
引用
收藏
页数:23
相关论文
共 44 条
  • [1] [Anonymous], LNCS
  • [2] [Anonymous], P EUR C MACH LEARN P
  • [3] Hierarchical multi-label prediction of gene function
    Barutcuoglu, Z
    Schapire, RE
    Troyanskaya, OG
    [J]. BIOINFORMATICS, 2006, 22 (07) : 830 - 836
  • [4] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [5] Cabral R. S., 2011, Advances in neural information processing systems, V201, P2
  • [6] Matrix Completion for Weakly-Supervised Multi-Label Image Classification
    Cabral, Ricardo
    De la Torre, Fernando
    Costeira, Joao Paulo
    Bernardino, Alexandre
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (01) : 121 - 135
  • [7] Synergy of multi-label hierarchical ensembles, data fusion, and cost-sensitive methods for gene functional inference
    Cesa-Bianchi, Nicolo
    Re, Matteo
    Valentini, Giorgio
    [J]. MACHINE LEARNING, 2012, 88 (1-2) : 209 - 241
  • [8] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [9] Combining instance-based learning and logistic regression for multilabel classification
    Cheng, Weiwei
    Huellermeier, Eyke
    [J]. MACHINE LEARNING, 2009, 76 (2-3) : 211 - 225
  • [10] Clare A., 2001, Lecture Notes in Computer Science, P42