Fuzzy Mutual Information Feature Selection Based on Representative Samples

被引:7
作者
Salem, Omar A. M. [1 ]
Wang, Liwei [2 ]
机构
[1] Suez Canal Univ, Fac Comp Sci & Informat, Ismailia, Egypt
[2] Wuhan Univ, Int Sch Software, Wuhan, Hubei, Peoples R China
关键词
Feature Selection; Fuzzy Mutual Information; Fuzzy Sets; Mutual Information;
D O I
10.4018/IJSI.2018010105
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Building classification models from real-world datasets became a difficult task, especially in datasets with high dimensional features. Unfortunately, these datasets may include irrelevant or redundant features which have a negative effect on the classification performance. Selecting the significant features and eliminating undesirable features can improve the classification models. Fuzzy mutual information is widely used feature selection to find the best feature subset before classification process. However, it requires more computation and storage space. To overcome these limitations, this paper proposes an improved fuzzy mutual information feature selection based on representative samples. Based on benchmark datasets, the experiments show that the proposed method achieved better results in the terms of classification accuracy, selected feature subset size, storage, and stability.
引用
收藏
页码:58 / 72
页数:15
相关论文
共 27 条
[11]   Information-preserving hybrid data reduction based on fuzzy-rough techniques [J].
Hu, QH ;
Yu, DR ;
Xie, ZX .
PATTERN RECOGNITION LETTERS, 2006, 27 (05) :414-423
[12]   Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation [J].
Hu, Qinghua ;
Xie, Zongxia ;
Yu, Daren .
PATTERN RECOGNITION, 2007, 40 (12) :3509-3521
[13]  
Janecek A, 2008, JMLR P WORKSH C ANTW, P90
[14]  
Kohavi R, 1995, P 14 INT JOINT C ART, P1137, DOI DOI 10.1067/MOD.2000.109031
[15]  
Kuncheva L.I., 2007, ARTIFICIAL INTELLIGE, P421
[16]   A novel feature selection method based on normalized mutual information [J].
La The Vinh ;
Lee, Sungyoung ;
Park, Young-Tack ;
d'Auriol, Brian J. .
APPLIED INTELLIGENCE, 2012, 37 (01) :100-120
[17]   A new method for measuring uncertainty and fuzziness in rough set theory [J].
Liang, JY ;
Chin, KS ;
Dang, CY ;
Yam, RCM .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2002, 31 (04) :331-342
[18]  
Nogueira S., 2016, P JOINT EUROPEAN C M, P442, DOI [10.1007/978-3-319-46227-1_28, DOI 10.1007/978-3-319-46227-1_28]
[19]   Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy [J].
Peng, HC ;
Long, FH ;
Ding, C .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) :1226-1238
[20]   A review of feature selection techniques in bioinformatics [J].
Saeys, Yvan ;
Inza, Inaki ;
Larranaga, Pedro .
BIOINFORMATICS, 2007, 23 (19) :2507-2517