Hierarchical feature selection with multi-granularity clustering structure

被引:21
作者
Guo, Shunxin [1 ,3 ]
Zhao, Hong [1 ,2 ]
Yang, Wenyuan [3 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Fujian, Peoples R China
[2] Fujian Prov Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou 363000, Fujian, Peoples R China
[3] Minnan Normal Univ, Fujian Key Lab Granular Comp & Applicat, Zhangzhou 363000, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Granular computing; Hierarchical feature selection; Multi-granularity clustering; Semantic gap; CLASSIFICATION; ALGORITHM; DATABASE;
D O I
10.1016/j.ins.2021.04.046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hierarchical feature selection addresses the issues caused by the presence of high-dimensional features in multi-category classification systems with hierarchical structures. Granular calculations are made to analyze the hierarchical relationships among categories when selecting the optimal feature subset. However, semantic hierarchy-based feature selection methods are prone to the semantic gap problem, which affects classification accuracy. In this paper, we propose a hierarchical feature selection method with a multi-granularity clustering structure that can effectively alleviate the semantic gap problem. Firstly, a hierarchical structure is constructed via bottom-up multi-granularity clustering based on feature similarities rather than semantic categories. This clustering hierarchy is conducive to solving semantic gap problems in the existing hierarchy. Secondly, the optimal feature subset is selected using the l(1,2)-norms in each hierarchy's granularity layer. This joint minimization approach can retain both the granularity layers' shared features and granularity-specific features. Finally, we execute hierarchical classification according to the granular structure in a coarse to fine sequence. Extensive experiments demonstrate that the proposed method outperforms several state-of-the-art hierarchical feature selection approaches. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:448 / 462
页数:15
相关论文
共 48 条
[1]   Clustering method for production of Z-number based if-then rules [J].
Aliev, R. A. ;
Pedrycz, Witold ;
Guirimov, B. G. ;
Huseynov, O. H. .
INFORMATION SCIENCES, 2020, 520 (520) :155-176
[2]   A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].
Beck, Amir ;
Teboulle, Marc .
SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202
[3]  
Chen MC, 2016, 2016 INTERNATIONAL CONFERENCE ON INFORMATICS, MANAGEMENT ENGINEERING AND INDUSTRIAL APPLICATION (IMEIA 2016), P1, DOI 10.1109/PLASMA.2016.7534032
[4]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]   Hierarchical annotation of medical images [J].
Dimitrovski, Ivica ;
Kocev, Dragi ;
Loskovska, Suzana ;
Dzeroski, Saso .
PATTERN RECOGNITION, 2011, 44 (10-11) :2436-2449
[7]   Multi-class protein fold recognition using support vector machines and neural networks [J].
Ding, CHQ ;
Dubchak, I .
BIOINFORMATICS, 2001, 17 (04) :349-358
[8]   A novel hybrid genetic algorithm with granular information for feature selection and optimization [J].
Dong, Hongbin ;
Li, Tao ;
Ding, Rui ;
Sun, Jing .
APPLIED SOFT COMPUTING, 2018, 65 :33-46
[9]  
Duda R. O., 2001, Pattern Classification, V2nd
[10]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&