Mutual Information Estimation with Random Forests

被引:0
作者
Koeman, Mike [1 ]
Heskes, Tom [1 ]
机构
[1] Radboud Univ Nijmegen, Inst Comp & Informat Sci, Nijmegen, Netherlands
来源
NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II | 2014年 / 8835卷
关键词
Mutual information; random forests; probabilistic classification trees; FEATURE-SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new method for estimating mutual information based on the random forests classifiers. This method uses random permutation of one of the two variables to create data where the two variables are independent. We show that mutual information can be estimated by the class probabilities of a probabilistic classifier trained on the independent against the dependent data. This method has the robustness and flexibility that random forests offers as well as the possibility to use mixtures of continuous and discrete data, unlike most other approaches for estimating mutual information. We tested our method on a variety of data and found it to be accurate with medium or large datasets yet inaccurate with smaller datasets. On the positive side, our method is capable to estimate the mutual information between sets of both continuous and discrete variables and appears to be relatively insensitive to the addition of noise variables.
引用
收藏
页码:524 / 531
页数:8
相关论文
共 15 条
[1]  
[Anonymous], 2011, Microsoft Research Cambridge, Tech. Rep. MSRTR-2011-114
[2]  
[Anonymous], 2006, Pattern recognition and machine learning
[3]  
Biau G, 2008, J MACH LEARN RES, V9, P2015
[4]   Calibrating Random Forests [J].
Bostrom, Henrik .
SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, :121-126
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Fleuret F, 2004, J MACH LEARN RES, V5, P1531
[8]   NONPARAMETRIC MULTIVARIATE DENSITY-ESTIMATION - A COMPARATIVE-STUDY [J].
HWANG, JN ;
LAY, SR ;
LIPPMAN, A .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (10) :2795-2810
[9]  
Jie Cheng, 1997, Proceedings of the Sixth International Conference on Information and Knowledge Management. CIKM'97, P325, DOI 10.1145/266714.266920
[10]  
Kraskov A, 2004, PHYS REV E, V69, DOI 10.1103/PhysRevE.69.066138