Mutual information-based method for selecting informative feature sets

被引:54
作者
Herman, Gunawan [1 ,2 ]
Zhang, Bang [1 ,2 ]
Wang, Yang [1 ,2 ]
Ye, Getian [3 ]
Chen, Fang [1 ,2 ]
机构
[1] Natl ICT Australia, Eveleigh, NSW 2015, Australia
[2] Univ New S Wales, Sydney, NSW 2052, Australia
[3] Canon Informat Syst Res Australia, N Ryde, NSW 2113, Australia
基金
澳大利亚研究理事会;
关键词
Feature selection; Mutual information; DEPENDENCY; EXTRACTION; PATTERNS;
D O I
10.1016/j.patcog.2013.04.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the fundamental problems in pattern recognition and data mining. A popular and effective approach to feature selection is based on information theory, namely the mutual information of features and class variable. In this paper we compare eight different mutual information-based feature selection methods. Based on the analysis of the comparison results, we propose a new mutual information-based feature selection method. By taking into account both the class-dependent and class-independent correlation among features, the proposed method selects a less redundant and more informative set of features. The advantage of the proposed method over other methods is demonstrated by the results of experiments on UCI datasets (Asuncion and Newman, 2010 [1]) and object recognition. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3315 / 3327
页数:13
相关论文
共 26 条
[1]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[2]   Mutual information-based selection of optimal spatial-temporal patterns for single-trial EEG-based BCIs [J].
Ang, Kai Keng ;
Chin, Zheng Yang ;
Zhang, Haihong ;
Guan, Cuntai .
PATTERN RECOGNITION, 2012, 45 (06) :2137-2144
[3]  
[Anonymous], FDN TRENDS COMPUTER
[4]  
[Anonymous], 2009, A practical guide to support vector classification
[5]  
[Anonymous], 1991, ELEMENTS INFORM THEO, DOI [DOI 10.1002/0471200611, 10.1002/0471200611]
[6]  
[Anonymous], 2010, UCI machine learning repository
[7]   On the Feature Selection Criterion Based on an Approximation of Multidimensional Mutual Information [J].
Balagani, Kiran S. ;
Phoha, Vir V. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (07) :1342-1343
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]   MILES: Multiple-Instance Learning via Embedded instance Selection [J].
Chen, Yixin ;
Bi, Jinbo ;
Wang, James Z. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :1931-1947
[10]   CLASS-DEPENDENT DISCRETIZATION FOR INDUCTIVE LEARNING FROM CONTINUOUS AND MIXED-MODE DATA [J].
CHING, JY ;
WONG, AKC ;
CHAN, KCC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (07) :641-651