Neighborhood rough sets with distance metric learning for feature selection

被引:68
作者
Yang, Xiaoling [1 ,2 ]
Chen, Hongmei [1 ,2 ]
Li, Tianrui [1 ,2 ]
Wan, Jihong [1 ,2 ]
Sang, Binbin [1 ,2 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data App, Chengdu, Sichuan, Peoples R China
基金
美国国家科学基金会;
关键词
Neighborhood rough sets; Neighborhood relation; Distance metric learning; Feature selection; STREAMING FEATURE-SELECTION; ATTRIBUTE REDUCTION; UNCERTAINTY MEASURES; CLASSIFICATION; OPTIMIZATION; MODEL;
D O I
10.1016/j.knosys.2021.107076
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neighborhood rough set is a useful mathematic tool to describe uncertainty in mixed data. Feature selection based on neighborhood rough set has been studied widely. However, most existing methods use a single predefined distance function to construct neighborhood granules. As not all datasets are created with the same way and data are also often disturbed with noisy, the same distance function may not be optimal for all datasets. This paper aims at improving the discriminative ability and decreasing the uncertainty in the representation from neighborhood rough set to deal with this issue. In this paper, distance learning method is first introduced into neighborhood rough set to optimize the structure of information granules. A novel neighborhood rough set model is then proposed, called Neighborhood rough set Model based on Distance metric learning (NMD). NMD exploits distance metric learning in which samples from the same decision achieve small distance than samples from different decisions. Such a method can improve the consistency of neighborhood granules. The paper also presents the properties of NMD and formulates the importance of feature. In addition, two feature selection algorithms are built upon the proposed NMD. Experimental results on real-world datasets demonstrate the effectiveness of the proposed feature selection algorithms and their superiority against comparison baselines. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 74 条
[31]   Safety monitoring data classification method based on wireless rough network of neighborhood rough sets [J].
Liu, Dan ;
Li, Jingwei .
SAFETY SCIENCE, 2019, 118 :103-108
[32]   Online multi-label streaming feature selection based on neighborhood rough set [J].
Liu, Jinghua ;
Lin, Yaojin ;
Li, Yuwen ;
Weng, Wei ;
Wu, Shunxiang .
PATTERN RECOGNITION, 2018, 84 :273-287
[33]   Impact of class noise on performance of hyperspectral band selection based on neighborhood rough set theory [J].
Liu, Yao ;
Cao, Xiaoda ;
Meng, Xiangli ;
Wu, Tao ;
Yan, Xiaozhen ;
Luo, Qinghua .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 188 :37-45
[34]   Stability analysis of hyperspectral band selection algorithms based on neighborhood rough set theory for classification [J].
Liu, Yao ;
Yang, Junjie ;
Chen, Yuehua ;
Tan, Kezhu ;
Wang, Liguo ;
Yan, Xiaozhen .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 169 :35-44
[35]   Structured optimal graph based sparse feature extraction for semi-supervised learning [J].
Liu, Zhonghua ;
Lai, Zhihui ;
Ou, Weihua ;
Zhang, Kaibing ;
Zheng, Ruijuan .
SIGNAL PROCESSING, 2020, 170
[36]   On modeling similarity and three-way decision under incomplete information in rough set theory [J].
Luo, Junfang ;
Fujita, Hamido ;
Yao, Yiyu ;
Qin, Keyun .
KNOWLEDGE-BASED SYSTEMS, 2020, 191
[37]   A neighborhood rough set model with nominal metric embedding [J].
Luo, Sheng ;
Miao, Duoqian ;
Zhang, Zhifei ;
Zhang, Yuanjian ;
Hu, Shengdan .
INFORMATION SCIENCES, 2020, 520 :373-388
[38]  
Nguyen HS, 1999, LECT NOTES ARTIF INT, V1711, P137
[39]   ROUGH SETS [J].
PAWLAK, Z .
INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05) :341-356
[40]   Rough sets: Some extensions [J].
Pawlak, Zdzislaw ;
Skowron, Andrzej .
INFORMATION SCIENCES, 2007, 177 (01) :28-40