Feature selection for hierarchical classification via joint semantic and structural information of labels

被引:24
|
作者
Huang, Hai [1 ,2 ]
Liu, Huan [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing 100876, Peoples R China
[2] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ 85287 USA
基金
中国国家自然科学基金;
关键词
Feature selection; Hierarchical classification; Label semantic similarity; Label hierarchical structure; PREDICTION; ANNOTATION; RELIEFF; GRAPH;
D O I
10.1016/j.knosys.2020.105655
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical Classification is widely used in many real-world applications, where the label space is exhibited as a tree or a Directed Acyclic Graph (DAG) and each label has rich semantic descriptions. Feature selection, as a type of dimension reduction technique, has proven to be effective in improving the performance of machine learning algorithms. However, many existing feature selection methods cannot be directly applied to hierarchical classification problems since they ignore the hierarchical relations and take no advantage of the semantic information in the label space. In this paper, we propose a novel feature selection framework based on semantic and structural information of labels. First, we transform the label description into a mathematical representation and calculate the similarity score between labels as the semantic regularization. Second, we investigate the hierarchical relations in a tree structure of the label space as the structural regularization. Finally, we impose two regularization terms on a sparse learning based model for feature selection. Additionally, we adapt the proposed model to a DAG case, which makes our method more general and robust in many real-world tasks. Experimental results on real-world datasets demonstrate the effectiveness of the proposed framework for hierarchical classification domains. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Embedding Feature Selection for Large-scale Hierarchical Classification
    Naik, Azad
    Rangwala, Huzefa
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1212 - 1221
  • [32] Incremental feature selection for large-scale hierarchical classification with the arrival of new samples
    Yang Tian
    Yanhong She
    Applied Intelligence, 2024, 54 : 3933 - 3953
  • [33] Fuzzy Rough Set Based Feature Selection for Large-Scale Hierarchical Classification
    Zhao, Hong
    Wang, Ping
    Hu, Qinghua
    Zhu, Pengfei
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (10) : 1891 - 1903
  • [34] Incremental feature selection for large-scale hierarchical classification with the arrival of new samples
    Tian, Yang
    She, Yanhong
    APPLIED INTELLIGENCE, 2024, 54 (05) : 3933 - 3953
  • [35] Feature Selection via Vectorizing Feature's Discriminative Information
    Wang, Jun
    Xu, Hengpeng
    Wei, Jinmao
    WEB TECHNOLOGIES AND APPLICATIONS, PT I, 2016, 9931 : 493 - 505
  • [36] Hierarchical Feature Selection Algorithm Combined with Category Information Constraints
    Zhang, Zhihui
    2024 CROSS STRAIT RADIO SCIENCE AND WIRELESS TECHNOLOGY CONFERENCE, CSRSWTC 2024, 2024, : 108 - 110
  • [37] Joint Semantic Feature Selection and Bandwidth Allocation for Image Transmission Optimization
    Chen, Chen
    Li, Yang
    Feng, Li
    2024 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA, ICCC, 2024,
  • [38] Boosting feature selection using information metric for classification
    Liu, Huawen
    Liu, Lei
    Zhang, Huijie
    NEUROCOMPUTING, 2009, 73 (1-3) : 295 - 303
  • [39] Textural feature selection by joint mutual information based on Gaussian mixture model for multispectral image classification
    Kerroum, Mounir Ait
    Hammouch, Ahmed
    Aboutajdine, Driss
    PATTERN RECOGNITION LETTERS, 2010, 31 (10) : 1168 - 1174
  • [40] Feature Selection based on Information Theory for Pattern Classification
    Krishna, R. Sathya Bama
    Aramudhan, M.
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1233 - 1236