Classification and regression via integer optimization

被引:78
作者
Bertsimas, Dimitris
Shioda, Romy
机构
[1] MIT, Sloan Sch Management & Operat Res Ctr, Cambridge, MA 02139 USA
[2] Univ Waterloo, Fac Math, Dept Combinator & Optimizat, Waterloo, ON N2L 3G1, Canada
关键词
Applications; Programming: integer; Statistics: nonparametric;
D O I
10.1287/opre.1060.0360
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Motivated by the significant advances in integer optimization in the past decade, we introduce mixed-integer optimization methods to the classical statistical problems of classification and regression and construct a software package called CRIO (classification and regression via integer optimization). CRIO separates data points into different polyhedral regions. In classification each region is assigned a class, while in regression each region has its own distinct regression coefficients. Computational experimentations with generated and real data sets show that CRIO is comparable to and often outperforms the current leading methods in classification and regression. We hope that these results illustrate the potential for significant impact of integer optimization methods on computational statistics and data mining.
引用
收藏
页码:252 / 271
页数:20
相关论文
共 23 条
[1]  
[Anonymous], 1999, The Nature Statist. Learn. Theory
[2]  
[Anonymous], 1999, APPL MULTIVARIATE AN
[3]  
[Anonymous], 1999, STOCHASTIC GRADIENT
[4]  
[Anonymous], 1987, ROBUST REGRESSION OU
[5]  
Arthanari T.S., 1993, Mathematical Programming in Statistics
[6]  
BENNETT KP, 1984, 214 DEP MATH SCI REN
[7]  
BERTSIMAS D, 2004, ALGORITHM CARDINALIT
[8]   Computational study of a family of mixed-integer quadratic programming problems [J].
Bienstock, D .
MATHEMATICAL PROGRAMMING, 1996, 74 (02) :121-140
[9]   Stability and generalization [J].
Bousquet, O ;
Elisseeff, A .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) :499-526
[10]  
Friedman J, Towards an expansive epistemology: Norms, action and the social sphere