Multivariate decision trees using linear discriminants and tabu search

被引:35
作者
Li, XB [1 ]
Sweigart, JR
Teng, JTC
Donohue, JM
Thombs, LA
Wang, SM
机构
[1] Univ Texas, Sch Management, Richardson, TX 75083 USA
[2] Univ S Carolina, Dept Management Sci, Columbia, SC 29208 USA
[3] Univ Texas, Dept Informat Syst & Operat Management, Coll Business Adm, Arlington, TX 76019 USA
[4] Univ S Carolina, Dept Stat, Columbia, SC 29208 USA
[5] AT&T Labs, Pleasanton, CA 94588 USA
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS | 2003年 / 33卷 / 02期
关键词
classification data mining; decision trees; linear discriminant; tabu search;
D O I
10.1109/TSMCA.2002.806499
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A new decision tree method for application in data mining, machine learning, pattern recognition, and other areas is proposed in this paper. The new method incorporates a classical multivariate statistical method, linear. discriminant function, into decision trees' recursive partitioning process. The proposed method considers not only the linear combination with all variables, but also combinations with fewer variables. It uses a tabu search technique to find appropriate variable combinations within a reasonable length of time. For problems with more than two classes, the tabu search technique is also used to group the data into two superclasses before each split. The results of our experimental study indicate that the proposed algorithm appears to outperform some of the major classification algorithms in terms of classification accuracy, the proposed algorithm generates decision trees with relatively small sizes, and the proposed algorithm runs faster than most multivariate decision trees and its computing time increases linearly with data size, indicating that the algorithm is scalable to large datasets.
引用
收藏
页码:194 / 205
页数:12
相关论文
共 51 条
[1]  
Alimoglu F., 1996, P 5 INT ART NEUR NET
[2]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[3]  
Bennett K.P., 1992, Technical report
[4]  
Blake C.L., 1998, UCI repository of machine learning databases
[5]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
BRODLEY CE, 1995, MACH LEARN, V19, P45, DOI 10.1007/BF00994660
[8]  
BUNTINE W, 1992, INTRO IND VERSION 2
[9]   Another move toward the minimum consistent subset:: A tabu search approach to the condensed nearest neighbor rule [J].
Cerverón, V ;
Ferri, FJ .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2001, 31 (03) :408-413
[10]  
Clark L.A., 1992, STAT MODELS S, P377