A filter approach to feature selection based on mutual information

被引:0
作者
Huang, Jinjie [1 ]
Cai, Yunze [1 ]
Xu, Xiaoming [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Dongchuan Rd 800, Shanghai 200240, Peoples R China
来源
PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2 | 2006年
基金
中国博士后科学基金; 黑龙江省自然科学基金; 中国国家自然科学基金;
关键词
pattern classification; machine learning; feature selection; filter approach; mutual information;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In pattern recognition, feature selection aims to choose the smallest subset of features that is necessary and sufficient to describe the target concept. In this paper, a mutual information-based constructive criterion under arbitrary information distributions of input features is presented for feature selection. This criterion can capture both the relevance to the output classes and the redundancy with respect to the already,-selected features without any parameters like 8 in MIFS or MIFS-U methods to be preset. Furthermore, a modified greedy feature selection algorithm called MICC is proposed, and experimental results demonstrate the good performance of MICC on both synthetic and benchmark data sets.
引用
收藏
页码:84 / 89
页数:6
相关论文
共 20 条
[11]   Wrappers for feature subset selection [J].
Kohavi, R ;
John, GH .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :273-324
[12]   Input feature selection for classification problems [J].
Kwak, N ;
Choi, CH .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (01) :143-159
[13]   A compact and accurate model for classification [J].
Last, M ;
Maimon, O .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (02) :203-215
[14]   Feature selection based on similarity [J].
Lazzerini, B ;
Marcelloni, F .
ELECTRONICS LETTERS, 2002, 38 (03) :121-122
[15]  
Liu H., 1998, Feature Extraction, Construction and Selection: A Data Mining Perspective
[16]  
MERZ CJ, 1996, REPOSITORY MACHINE L
[17]   Improved use of continuous attributes in C4.5 [J].
Quinlan, JR .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :77-90
[18]  
Shannon Claude, 1998, The Mathematical Theory of Communication
[19]  
VAPNIK V, 1998, STAT LEARING THEORY
[20]   Feature extraction and dimensionality reduction algorithms and their applications in vowel recognition [J].
Wang, XC ;
Paliwal, KK .
PATTERN RECOGNITION, 2003, 36 (10) :2429-2439