Integrating Global and Local Feature Selection for Multi-Label Learning

被引:7
作者
Zhang, Zan [1 ,2 ]
Liu, Lin [3 ]
Li, Jiuyong [3 ]
Wu, Xindong [4 ]
机构
[1] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Intelligent Interconnected Syst Lab Anhui Prov, Minist Educ China, Hefei 230601, Anhui, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230601, Anhui, Peoples R China
[3] Univ South Australia, UniSA STEM, Adelaide, SA 5095, Australia
[4] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Minist Educ China, Hefei 230601, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label learning; label correlations; Local Feature Selection; DISCRIMINATIVE FEATURE-SELECTION; CLASSIFICATION;
D O I
10.1145/3532190
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label learning deals with the problem where an instance is associated with multiple labels simultaneously. Multi-label data is often of high dimensionality and has many noisy, irrelevant, and redundant features. As an important machine learning task, multi-label feature selection has received considerable attention in recent years due to its promising performance in dealing with high-dimensional multi-label data. Existing multi-label feature selection methods typically select the global features which are shared by all instances in a dataset. However, these multi-label feature selection methods may be suboptimal since they do not consider the specific characteristics of instances. In this paper, we propose a novel algorithm that integrates Global and Local Feature Selection (GLFS) to exploit both the global features and a subset of discriminative features shared only locally by a subgroup of instances in a multi-label dataset. Specifically, GLFS employs linear regression and l(2,1)-norm on the regression parameters to achieve simultaneous global and local feature selection. Moreover, the proposed algorithm has an effective mechanism for utilizing label correlations to improve the feature selection. Experiments on real-world multi-label datasets show the superiority of GLFS over the state-of-the-art multi-label feature selection methods.
引用
收藏
页数:37
相关论文
共 62 条
  • [1] ALGORITHM - SOLUTION OF MATRIX EQUATION AX+XB = C
    BARTELS, RH
    STEWART, GW
    [J]. COMMUNICATIONS OF THE ACM, 1972, 15 (09) : 820 - &
  • [2] Hierarchical multi-label prediction of gene function
    Barutcuoglu, Z
    Schapire, RE
    Troyanskaya, OG
    [J]. BIOINFORMATICS, 2006, 22 (07) : 830 - 836
  • [3] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [4] Instance Annotation for Multi-Instance Multi-Label Learning
    Briggs, Forrest
    Fern, Xiaoli Z.
    Raich, Raviv
    Lou, Qi
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2013, 7 (03)
  • [5] Cabral R., 2011, NIPS, P190
  • [6] Multi-label feature selection via feature manifold learning and sparsity regularization
    Cai, Zhiling
    Zhu, William
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (08) : 1321 - 1334
  • [7] Synergy of multi-label hierarchical ensembles, data fusion, and cost-sensitive methods for gene functional inference
    Cesa-Bianchi, Nicolo
    Re, Matteo
    Valentini, Giorgio
    [J]. MACHINE LEARNING, 2012, 88 (1-2) : 209 - 241
  • [8] A novel approach for learning label correlation with application to feature selection of multi-label data
    Che, Xiaoya
    Chen, Degang
    Mi, Jusheng
    [J]. INFORMATION SCIENCES, 2020, 512 (512) : 795 - 812
  • [9] Multi-Label Image Recognition with Graph Convolutional Networks
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
  • [10] MEAN SHIFT, MODE SEEKING, AND CLUSTERING
    CHENG, YZ
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (08) : 790 - 799