Multi-Label Feature Selection using Correlation Information

被引:44
|
作者
Braytee, Ali [1 ]
Liu, Wei [3 ]
Catchpoole, Daniel R. [2 ]
Kennedy, Paul J. [1 ]
机构
[1] Univ Technol Sydney, Ctr Artificial Intelligence, Sydney, NSW, Australia
[2] Childrens Hosp Westmead, Tumour Bank, Childrens Canc Res Unit, Westmead, NSW, Australia
[3] Univ Technol Sydney, Adv Analyt Inst, Sydney, NSW, Australia
来源
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT | 2017年
关键词
Multi-label feature selection; Multi-label classification; High dimensional data; New application;
D O I
10.1145/3132847.3132858
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-dimensional multi-labeled data contain instances, where each instance is associated with a set of class labels and has a large number of noisy and irrelevant features. Feature selection has been shown to have great benefits in improving the classification performance in machine learning. In multi-label learning, to select the discriminative features among multiple labels, several challenges should be considered: interdependent labels, different instances may share different label correlations, correlated features, and missing and flawed labels. This work is part of a project at The Children's Hospital at Westmead (TB-CHW), Australia to explore the genomics of childhood leukaemia. In this paper, we propose a CMFS (Correlated- and Multi-label Feature Selection method), based on non-negative matrix factorization (NMF) for simultaneously performing feature selection and addressing the aforementioned challenges. Significantly, a major advantage of our research is to exploit the correlation information contained in features, labels and instances to select the relevant features among multiple labels. Furthermore, /2,1-norm regularization is incorporated in the objective function to undertake feature selection by imposing sparsity on the feature matrix rows. We employ CMFS to decompose the data and multi-label matrices into a low-dimensional space. To solve the objective function, an efficient iterative optimization algorithm is proposed with guaranteed convergence. Finally, extensive experiments are conducted on high-dimensional multi-labeled datasets. The experimental results demonstrate that our method significantly outperforms state-of-the-art multi-label feature selection methods.
引用
收藏
页码:1649 / 1656
页数:8
相关论文
共 50 条
  • [1] Learning correlation information for multi-label feature selection
    Fan, Yuling
    Liu, Jinghua
    Tang, Jianeng
    Liu, Peizhong
    Lin, Yaojin
    Du, Yongzhao
    PATTERN RECOGNITION, 2024, 145
  • [2] Multi-label feature selection based on correlation label enhancement
    He, Zhuoxin
    Lin, Yaojin
    Wang, Chenxi
    Guo, Lei
    Ding, Weiping
    INFORMATION SCIENCES, 2023, 647
  • [3] Multi-label feature selection with global and local label correlation
    Faraji, Mohammad
    Seyedi, Seyed Amjad
    Tab, Fardin Akhlaghian
    Mahmoodi, Reza
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [4] Online Multi-Label Streaming Feature Selection With Label Correlation
    You, Dianlong
    Wang, Yang
    Xiao, Jiawei
    Lin, Yaojin
    Pan, Maosheng
    Chen, Zhen
    Shen, Limin
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (03) : 2901 - 2915
  • [5] Label relaxation and shared information for multi-label feature selection
    Fan, Yuling
    Chen, Xu
    Luo, Shimu
    Liu, Peizhong
    Liu, Jinghua
    Chen, Baihua
    Tang, Jianeng
    INFORMATION SCIENCES, 2024, 671
  • [6] Fuzzy information gain ratio-based multi-label feature selection with label correlation
    Yu, Ying
    Lv, Meiyue
    Qian, Jin
    Lv, Jingqin
    Miao, Duoqian
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2737 - 2747
  • [7] Feature Selection for Multi-label Learning Using Mutual Information and GA
    Yu, Ying
    Wang, Yinglong
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 : 454 - 463
  • [8] Feature selection for multi-label classification using multivariate mutual information
    Lee, Jaesung
    Kim, Dae-Won
    PATTERN RECOGNITION LETTERS, 2013, 34 (03) : 349 - 357
  • [9] Multi-Label Feature Selection with Conditional Mutual Information
    Wang, Xiujuan
    Zhou, Yuchen
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] Approximating mutual information for multi-label feature selection
    Lee, J.
    Lim, H.
    Kim, D. -W.
    ELECTRONICS LETTERS, 2012, 48 (15) : 929 - 930