Hierarchical feature selection with multi-granularity clustering structure

被引:21
作者
Guo, Shunxin [1 ,3 ]
Zhao, Hong [1 ,2 ]
Yang, Wenyuan [3 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Fujian, Peoples R China
[2] Fujian Prov Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou 363000, Fujian, Peoples R China
[3] Minnan Normal Univ, Fujian Key Lab Granular Comp & Applicat, Zhangzhou 363000, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Granular computing; Hierarchical feature selection; Multi-granularity clustering; Semantic gap; CLASSIFICATION; ALGORITHM; DATABASE;
D O I
10.1016/j.ins.2021.04.046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hierarchical feature selection addresses the issues caused by the presence of high-dimensional features in multi-category classification systems with hierarchical structures. Granular calculations are made to analyze the hierarchical relationships among categories when selecting the optimal feature subset. However, semantic hierarchy-based feature selection methods are prone to the semantic gap problem, which affects classification accuracy. In this paper, we propose a hierarchical feature selection method with a multi-granularity clustering structure that can effectively alleviate the semantic gap problem. Firstly, a hierarchical structure is constructed via bottom-up multi-granularity clustering based on feature similarities rather than semantic categories. This clustering hierarchy is conducive to solving semantic gap problems in the existing hierarchy. Secondly, the optimal feature subset is selected using the l(1,2)-norms in each hierarchy's granularity layer. This joint minimization approach can retain both the granularity layers' shared features and granularity-specific features. Finally, we execute hierarchical classification according to the granular structure in a coarse to fine sequence. Extensive experiments demonstrate that the proposed method outperforms several state-of-the-art hierarchical feature selection approaches. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:448 / 462
页数:15
相关论文
共 48 条
[21]   3D Object Representations for Fine-Grained Categorization [J].
Krause, Jonathan ;
Stark, Michael ;
Deng, Jia ;
Li Fei-Fei .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :554-561
[22]  
Krizhevsky A., 2009, Citeseer, Tech. Rep.
[23]   Protein Folds Prediction with Hierarchical Structured SVM [J].
Li, Dapeng ;
Ju, Ying ;
Zou, Quan .
CURRENT PROTEOMICS, 2016, 13 (02) :79-85
[24]   An efficient rough feature selection algorithm with a multi-granulation view [J].
Liang, Jiye ;
Wang, Feng ;
Dang, Chuangyin ;
Qian, Yuhua .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2012, 53 (06) :912-926
[25]   Multi-granularity feature selection on cost-sensitive data with measurement errors and variable costs [J].
Liao, Shujiao ;
Zhu, Qingxin ;
Qian, Yuhua ;
Lin, Guoping .
KNOWLEDGE-BASED SYSTEMS, 2018, 158 :25-42
[26]  
Liu J., 2013, SLEP SPARSE LEARNING, P1
[27]   Topic-Based Algorithm for Multilabel Learning With Missing Labels [J].
Ma, Jianghong ;
Chow, Tommy W. S. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (07) :2138-2152
[28]   A Bayesian generative model for learning semantic hierarchies [J].
Mittelman, Roni ;
Sun, Min ;
Kuipers, Benjamin ;
Savarese, Silvio .
FRONTIERS IN PSYCHOLOGY, 2014, 5
[29]   SCOP - A STRUCTURAL CLASSIFICATION OF PROTEINS DATABASE FOR THE INVESTIGATION OF SEQUENCES AND STRUCTURES [J].
MURZIN, AG ;
BRENNER, SE ;
HUBBARD, T ;
CHOTHIA, C .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 247 (04) :536-540
[30]  
Nie F., 2010, NEURAL INF PROCESS S, P1813, DOI DOI 10.5555/2997046.2997098