A novel approach to cutting decision trees

被引:3
作者
Uney-Yuksektepe, Fadime [1 ]
机构
[1] Istanbul Kultur Univ, Dept Ind Engn, TR-34156 Istanbul, Turkey
关键词
Discriminant analysis; Mathematical programming; Data mining; Decision trees; Piecewise-linear models; DISCRIMINANT-ANALYSIS; PROGRAMMING-MODELS; CLASSIFICATION;
D O I
10.1007/s10100-013-0312-9
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In data mining, binary classification has a wide range of applications. Cutting Decision Tree (CDT) induction is an efficient mathematical programming based method that tries to discretize the data set on hand by using multiple separating hyperplanes. A new improvement to CDT model is proposed in this study by incorporating the second goal of maximizing the distance of the correctly classified instances to the misclassification region. Computational results show that developed model achieves better classification accuracy for Wisconsin Breast Cancer database and Japanese Banks data set when compared to existing piecewise-linear models in literature. Furthermore, remarkable results are obtained for the well-known benchmarking data sets (Buba Liver Disorders, Blood Tranfusion and Pima Indian Diabetes) when compared to the original CDT model.
引用
收藏
页码:553 / 565
页数:13
相关论文
共 29 条
[1]   NEW LP BASED HEURISTICS FOR THE CLASSIFICATION PROBLEM [J].
ABAD, PL ;
BANKS, WJ .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1993, 67 (01) :88-100
[2]  
[Anonymous], 2003, Statistical pattern recognition
[3]  
[Anonymous], 2006, Introduction to Data Mining
[4]  
[Anonymous], 2011, Pei. data mining concepts and techniques
[5]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[6]   Classification of drug molecules considering their IC50 values using mixed-integer linear programming based hyper-boxes method [J].
Armutlu, Pelin ;
Ozdemir, Muhittin E. ;
Uney-Yuksektepe, Fadime ;
Kavakli, I. Halil ;
Turkay, Metin .
BMC BIOINFORMATICS, 2008, 9 (1)
[7]  
Bennett KP, 1998, IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, P2396, DOI 10.1109/IJCNN.1998.687237
[8]   Classification by vertical and cutting multi-hyperplane decision tree induction [J].
Better, Marco ;
Glover, Fred ;
Samorani, Michele .
DECISION SUPPORT SYSTEMS, 2010, 48 (03) :430-436
[9]  
BRODLEY CE, 1995, MACH LEARN, V19, P45, DOI 10.1007/BF00994660
[10]   Optimization Based Tumor Classification from Microarray Gene Expression Data [J].
Dagliyan, Onur ;
Uney-Yuksektepe, Fadime ;
Kavakli, I. Halil ;
Turkay, Metin .
PLOS ONE, 2011, 6 (02)