Student-t kernelized fuzzy rough set model with fuzzy divergence for feature selection

被引:21
作者
Yang, Xiaoling [1 ,2 ]
Chen, Hongmei [1 ,2 ]
Li, Tianrui [1 ,2 ]
Zhang, Pengfei [1 ,2 ]
Luo, Chuan [3 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu, Peoples R China
[3] Sichuan Univ, Coll Comp Sci, Chengdu, Peoples R China
基金
美国国家科学基金会;
关键词
Feature selection; Fuzzy rough set; Fuzzy relation; Student; t kernel; Fuzzy divergence; ATTRIBUTE REDUCTION; MUTUAL INFORMATION; DEPENDENCY; RELEVANCE; ENTROPY;
D O I
10.1016/j.ins.2022.07.139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fuzzy rough set theory can tackle feature redundancy in data and select more informative features for machine learning tasks. Gaussian kernel is often coupled with fuzzy rough set theory to measure fuzzy relation between data instances. However, Gaussian kernel has a serious long-tail phenomenon, which would perform poorly in modeling the fuzzy relation for high-dimensional data. Moreover, a robust feature evaluation function is also nontrivial in a fuzzy rough set model because a naive model may select those non-optimal feature subsets due to the perturbations from redundant features. This paper delves into Student -t kernel and fuzzy divergence to address these challenges for fuzzy rough feature selection. This paper proposes a new Student -t Kernelized Fuzzy Rough Set (SKFRS) model. The new model uses fuzzy divergence to evaluate uncertain information in the data. It also explores a newly-defined feature evaluation function on the biases of the dynamic relation between the relevance and indispensability of features in feature selection process. A novel forward greedy search algorithm is then presented to solve the final objective function. The selected features are subsequently evaluated on downstream classification tasks. Experimental results using real-world datasets demonstrate the effectiveness of the pro-posed model and its superiority against the baseline methods.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:52 / 72
页数:21
相关论文
共 50 条
[1]   Data-Distribution-Aware Fuzzy Rough Set Model and its Application to Robust Classification [J].
An, Shuang ;
Hu, Qinghua ;
Pedrycz, Witold ;
Zhu, Pengfei ;
Tsang, Eric C. C. .
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (12) :3073-3085
[2]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[3]  
Bradley P. S., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P82
[4]   Parameterized attribute reduction with Gaussian kernel based fuzzy rough sets [J].
Chen, Degang ;
Hu, Qinghua ;
Yang, Yongping .
INFORMATION SCIENCES, 2011, 181 (23) :5169-5179
[5]   A graph approach for fuzzy -rough feature selection [J].
Chen, Jinkun ;
Mi, Jusheng ;
Lin, Yaojin .
FUZZY SETS AND SYSTEMS, 2020, 391 :96-116
[6]   Cross-entropy measure of uncertain variables [J].
Chen, Xiaowei ;
Kar, Samarjit ;
Ralescu, Dan A. .
INFORMATION SCIENCES, 2012, 201 :53-60
[7]   A novel fuzzy rule extraction approach using Gaussian kernel-based granular computing [J].
Dai, Guangyao ;
Hu, Yi ;
Yang, Yu ;
Zhang, Nanxun ;
Abraham, Ajith ;
Liu, Hongbo .
KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (02) :821-846
[8]   Feature selection via normative fuzzy information weight with application into tumor classification [J].
Dai, Jianhua ;
Chen, Jiaolong .
APPLIED SOFT COMPUTING, 2020, 92
[9]   Maximal-Discernibility-Pair-Based Approach to Attribute Reduction in Fuzzy Rough Sets [J].
Dai, Jianhua ;
Hu, Hu ;
Wu, Wei-Zhi ;
Qian, Yuhua ;
Huang, Debiao .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (04) :2174-2187
[10]   Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification [J].
Dai, Jianhua ;
Xu, Qing .
APPLIED SOFT COMPUTING, 2013, 13 (01) :211-221