Feature Selection for Multi-label Classification Using Neighborhood Preservation

被引:33
作者
Cai, Zhiling [1 ]
Zhu, William [1 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Lab Granular Comp & AI, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; multi-label learning; neighborhood relationship preserving; sample similarity; TRANSFORMATION; REDUCTION;
D O I
10.1109/JAS.2017.7510781
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label learning deals with data associated with a set of labels simultaneously. Dimensionality reduction is an important but challenging task in multi-label learning. Feature selection is an efficient technique for dimensionality reduction to search an optimal feature subset preserving the most relevant information. In this paper, we propose an effective feature evaluation criterion for multi-label feature selection, called neighborhood relationship preserving score. This criterion is inspired by similarity preservation, which is widely used in single-label feature selection. It evaluates each feature subset by measuring its capability in preserving neighborhood relationship among samples. Unlike similarity preservation, we address the order of sample similarities which can well express the neighborhood relationship among samples, not just the pairwise sample similarity. With this criterion, we also design one ranking algorithm and one greedy algorithm for feature selection problem. The proposed algorithms are validated in six publicly available data sets from machine learning repository. Experimental results demonstrate their superiorities over the compared state-of-the-art methods.
引用
收藏
页码:320 / 330
页数:11
相关论文
共 59 条
  • [1] [Anonymous], AMSTER658
  • [2] [Anonymous], 2002, Principal components analysis
  • [3] [Anonymous], 1997, ICML
  • [4] [Anonymous], MULTIVARIATE ANAL
  • [5] [Anonymous], 2002, INT C NEUR INF PROC, DOI DOI 10.5555/2968618.2968710
  • [6] [Anonymous], 2008, ISMIR
  • [7] Hierarchical multi-label prediction of gene function
    Barutcuoglu, Z
    Schapire, RE
    Troyanskaya, OG
    [J]. BIOINFORMATICS, 2006, 22 (07) : 830 - 836
  • [8] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [9] Chang XJ, 2014, AAAI CONF ARTIF INTE, P1171
  • [10] Document transformation for multi-label feature selection in text categorization
    Chen, Weizhu
    Yan, Jun
    Zhang, Benyu
    Chen, Zheng
    Yang, Qiang
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 451 - +