Feature selection for hierarchical classification via joint semantic and structural information of labels

被引:24
|
作者
Huang, Hai [1 ,2 ]
Liu, Huan [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing 100876, Peoples R China
[2] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ 85287 USA
基金
中国国家自然科学基金;
关键词
Feature selection; Hierarchical classification; Label semantic similarity; Label hierarchical structure; PREDICTION; ANNOTATION; RELIEFF; GRAPH;
D O I
10.1016/j.knosys.2020.105655
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical Classification is widely used in many real-world applications, where the label space is exhibited as a tree or a Directed Acyclic Graph (DAG) and each label has rich semantic descriptions. Feature selection, as a type of dimension reduction technique, has proven to be effective in improving the performance of machine learning algorithms. However, many existing feature selection methods cannot be directly applied to hierarchical classification problems since they ignore the hierarchical relations and take no advantage of the semantic information in the label space. In this paper, we propose a novel feature selection framework based on semantic and structural information of labels. First, we transform the label description into a mathematical representation and calculate the similarity score between labels as the semantic regularization. Second, we investigate the hierarchical relations in a tree structure of the label space as the structural regularization. Finally, we impose two regularization terms on a sparse learning based model for feature selection. Additionally, we adapt the proposed model to a DAG case, which makes our method more general and robust in many real-world tasks. Experimental results on real-world datasets demonstrate the effectiveness of the proposed framework for hierarchical classification domains. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Semantic-gap-oriented feature selection in hierarchical classification learning
    Liu, Haoyang
    Lin, Yaojin
    Wang, Chenxi
    Guo, Lei
    Chen, Jinkun
    INFORMATION SCIENCES, 2023, 642
  • [2] Hierarchical Classification and Regression with Feature Selection
    Ke, Shih-Wen
    Yeh, Chi-Wei
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2019, : 1150 - 1154
  • [3] Assessing university enrollment and admission efforts via hierarchical classification and feature selection
    Maldonado, Sebastian
    Armelini, Guillermo
    Angelo Guevara, C.
    INTELLIGENT DATA ANALYSIS, 2017, 21 (04) : 945 - 962
  • [4] Semantic similarity-aware feature selection and redundancy removal for text classification using joint mutual information
    Lazhar, Farek
    Amira, Benaidja
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (10) : 6187 - 6212
  • [5] Joint multi-label classification and label correlations with missing labels and feature selection
    He, Zhi-Fen
    Yang, Ming
    Gao, Yang
    Liu, Hui-Dong
    Yin, Yilong
    KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 145 - 158
  • [6] Feature selection via normative fuzzy information weight with application into tumor classification
    Dai, Jianhua
    Chen, Jiaolong
    APPLIED SOFT COMPUTING, 2020, 92
  • [7] Feature Selection for Hierarchical Multi-label Classification
    da Silva, Luan V. M.
    Cerri, Ricardo
    ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 196 - 208
  • [8] A Novel Hybrid Feature Selection Algorithm for Hierarchical Classification
    Lima, Helen C. S. C.
    Otero, Fernando E. B.
    Merschmann, Luiz H. C.
    Souza, Marcone J. F.
    IEEE ACCESS, 2021, 9 : 127278 - 127292
  • [9] Joint imbalanced classification and feature selection for hospital readmissions
    Du, Guodong
    Zhang, Jia
    Luo, Zhiming
    Ma, Fenglong
    Ma, Lei
    Li, Shaozi
    KNOWLEDGE-BASED SYSTEMS, 2020, 200
  • [10] A Recursive Regularization Based Feature Selection Framework for Hierarchical Classification
    Zhao, Hong
    Hu, Qinghua
    Zhu, Pengfei
    Wang, Yu
    Wang, Ping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (07) : 2833 - 2846