Feature extraction based on direct calculation of mutual information

被引:12
作者
Kwak, Nojun [1 ]
机构
[1] Ajou Univ, Div Elect & Comp Engn, Suwon 443749, South Korea
关键词
feature extraction; mutual information; Parzen window; gradient descent; subspace method; optimization; classification;
D O I
10.1142/S0218001407005892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many pattern recognition problems, it is desirable to reduce the number of input features by extracting important features related to the problems. By focusing on only the problem-relevant features, the dimension of features can be greatly reduced and thereby can result in a better generalization performance with less computational complexity. In this paper, we propose a feature extraction method for handling classification problems. The proposed algorithm is used to search for a set of linear combinations of the original features, whose mutual information with the output class can be maximized. The mutual information between the extracted features and the output class is calculated by using the probability density estimation based on the Parzen window method. A greedy algorithm using the gradient descent method is used to determine the new features. The computational load is proportional to the square of the number of samples. The proposed method was applied to several classification problems, which showed better or comparable performances than the conventional feature extraction methods.
引用
收藏
页码:1213 / 1231
页数:19
相关论文
共 28 条
[1]  
ADDISON D, 2003, P INT C ART NEUR NET, P212
[2]  
[Anonymous], 9201 J COOK U N QUEE
[3]   Weighted Parzen windows for pattern classification [J].
Babich, GA ;
Camps, OI .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (05) :567-570
[4]   Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].
Belhumeur, PN ;
Hespanha, JP ;
Kriegman, DJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720
[5]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[6]  
CIOS KJ, 1998, DATA MINING METHODS, pCH9
[7]  
Cover TM, 2006, Elements of Information Theory
[8]  
Devijver P., 1982, PATTERN RECOGN
[9]  
FISHER JW, 1998, P INT JOINT C NEUR N
[10]   THE REDUCED PARZEN CLASSIFIER [J].
FUKUNAGA, K ;
HAYES, RR .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (04) :423-425