Examining the effect of second-order terms in mathematical programming approaches to the classification problem

被引:4
作者
Wanarat, P [1 ]
Pavur, R [1 ]
机构
[1] UNIV N TEXAS,COLL BUSINESS,BCIS DEPT,DENTON,TX 76203
关键词
mathematical programming; linear programming; mixed integer programming; discriminant analysis;
D O I
10.1016/0377-2217(95)00076-3
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Research on mathematical programming approaches to the classification problem has focused almost exclusively on linear discriminant functions with only first-order terms. While many of these first-order models have displayed excellent classificatory performance when compared to Fisher's linear discriminant method, they cannot compete with Smith's quadratic discriminant method on certain data sets. In this paper, we investigate the appropriateness of including second-order terms in mathematical programming models, Various issues are addressed, such as performance of models with small to moderate sample size, need for crossproduct terms, and loss of power by the mathematical programming models under conditions ideal for the parametric procedures, A simulation study is conducted to assess the relative performance of first-order and second-order mathematical programming models to the parametric procedures. The simulation study indicates that mathematical programming models using polynomial functions may be prone to overfitting on the training samples which in turn may cause rather poor fits on the validation samples. The simulation study also indicates that inclusion of cross-product terms may hurt a polynomial model's accuracy on the validation samples, although omission of them means that the model is not invariant to nonsingular transformations of the data.
引用
收藏
页码:582 / 601
页数:20
相关论文
共 28 条
[1]   AN EFFICIENT OPTIMAL SOLUTION ALGORITHM FOR THE CLASSIFICATION PROBLEM [J].
BANKS, WJ ;
ABAD, PL .
DECISION SCIENCES, 1991, 22 (05) :1008-1023
[2]  
BANKS WJ, 1994, EUR J OPER RES, V74, P23
[3]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[4]   RESOLVING CERTAIN DIFFICULTIES AND IMPROVING THE CLASSIFICATION POWER OF LP DISCRIMINANT-ANALYSIS FORMULATIONS [J].
FREED, N ;
GLOVER, F .
DECISION SCIENCES, 1986, 17 (04) :589-595
[5]  
Freed N., 1981, Decision Sciences, V12, P68, DOI 10.1111/j.1540-5915.1981.tb00061.x
[7]   IMPROVED LINEAR-PROGRAMMING MODELS FOR DISCRIMINANT-ANALYSIS [J].
GLOVER, F .
DECISION SCIENCES, 1990, 21 (04) :771-785
[8]   A NEW CLASS OF MODELS FOR THE DISCRIMINANT PROBLEM [J].
GLOVER, F ;
KEENE, S ;
DUEA, B .
DECISION SCIENCES, 1988, 19 (02) :269-280
[9]  
Hand D.J., 1981, DISCRIMINATION CLASS
[10]   4 APPROACHES TO THE CLASSIFICATION PROBLEM IN DISCRIMINANT-ANALYSIS - AN EXPERIMENTAL-STUDY [J].
JOACHIMSTHALER, EA ;
STAM, A .
DECISION SCIENCES, 1988, 19 (02) :322-333