Hybrid Correlation and Causal Feature Selection for Ensemble Classifiers

被引:0
作者
Duangsoithong, Rakkrit [1 ]
Windeatt, Terry [1 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
来源
ENSEMBLES IN MACHINE LEARNING APPLICATIONS | 2011年 / 373卷
关键词
ALGORITHMS; CLASSIFICATION; RELEVANCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
PC and TPDA algorithms are robust and well known prototype algorithms, incorporating constraint-based approaches for causal discovery. However, both algorithms cannot scale up to deal with high dimensional data, that is more than few hundred features. This chapter presents hybrid correlation and causal feature selection for ensemble classifiers to deal with this problem. Redundant features are removed by correlation-based feature selection and then irrelevant features are eliminated by causal feature selection. The number of eliminated features, accuracy, the area under the receiver operating characteristic curve (AUC) and false negative rate (FNR) of proposed algorithms are compared with correlation-based feature selection (FCBF and CFS) and causal based feature selection algorithms (PC, TPDA, GS, IAMB).
引用
收藏
页码:97 / 115
页数:19
相关论文
共 27 条
[1]  
Aliferis C F, 2003, AMIA Annu Symp Proc, P21
[2]  
Almuallim H., 1991, AAAI-91. Proceedings Ninth National Conference on Artificial Intelligence, P547
[3]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[4]  
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
[5]  
Brown LE., 2008, TR0801 DSL
[6]  
Duangsoithong R, 2009, LECT NOTES ARTIF INT, V5632, P206, DOI 10.1007/978-3-642-03070-3_16
[7]  
Friedman N, 1999, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, P206
[8]  
Guyon I., 2008, Causality Workbench
[9]  
Guyon I, 2008, CH CRC DATA MIN KNOW, P63
[10]  
Hall M. A., 2000, P 17 INT C MACH LEAR, P359, DOI DOI 10.5555/645529.657793