Optimum simultaneous discretization with data grid models in supervised classification: a Bayesian model selection approach

被引:0
作者
Marc Boullé
机构
[1] Orange Labs,
来源
Advances in Data Analysis and Classification | 2009年 / 3卷
关键词
Data preparation; Discretization; Feature selection; Model selection; Supervised classification; 62H17; 62H20; 62H30;
D O I
暂无
中图分类号
学科分类号
摘要
In the domain of data preparation for supervised classification, filter methods for variable ranking are time efficient. However, their intrinsic univariate limitation prevents them from detecting redundancies or constructive interactions between variables. This paper introduces a new method to automatically, rapidly and reliably extract the classificatory information of a pair of input variables. It is based on a simultaneous partitioning of the domains of each input variable, into intervals in the numerical case and into groups of categories in the categorical case. The resulting input data grid allows to quantify the joint information between the two input variables and the output variable. The best joint partitioning is searched by maximizing a Bayesian model selection criterion. Intensive experiments demonstrate the benefits of the approach, especially the significant improvement of accuracy for classification tasks.
引用
收藏
页码:39 / 61
页数:22
相关论文
共 37 条
[1]  
Bay S(2001)Multivariate discretization for set mining Mach Learn 3 491-512
[2]  
Berger J(2006)The case of objective Bayesian analysis Bayesian Anal 1 385-402
[3]  
Boullé M(2004)Khiops: a statistical discretization method of continuous attributes Mach Learn 55 53-69
[4]  
Boullé M(2005)A Bayes optimal approach for partitioning the values of categorical attributes J Mach Learn Res 6 1431-1452
[5]  
Boullé M(2006)MODL: a Bayes optimal discretization method for continuous attributes Mach Learn 65 131-165
[6]  
Boullé M(2007)Compression-based averaging of selective naive Bayes classifiers J Mach Learn Res 8 1659-1685
[7]  
Carr D(1987)Scatterplot matrix techniques for large J Am Stat Assoc 82 424-436
[8]  
Littlefield R(1954)Some methods for strengthening the common chi-squared tests Biometrics 10 417-451
[9]  
Nicholson W(1992)On the handling of continuous-valued attributes in decision tree generation Mach Learn 8 87-102
[10]  
Littlefield J(2006)Subjective Bayesian analysis: principles and practice Bayesian Anal 1 403-420