Feature Subset Selection for Ordered Logit Model via Tangent-Plane-Based Approximation

被引:7
作者
Naganuma, Mizuho [1 ]
Takano, Yuichi [2 ]
Miyashiro, Ryuhei [3 ]
机构
[1] Univ Electrocommun, Grad Sch Informat & Engn, Chofu, Tokyo 1828585, Japan
[2] Univ Tsukuba, Fac Engn Informat & Syst, Tsukuba, Ibaraki 3058577, Japan
[3] Tokyo Univ Agr & Technol, Inst Engn, Koganei, Tokyo 1848588, Japan
关键词
optimization; statistics; feature subset selection; ordered logit model; LOGISTIC-REGRESSION; INTEGER; FORMULATIONS;
D O I
10.1587/transinf.2018EDP7188
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper is concerned with a mixed-integer optimization (MIO) approach to selecting a subset of relevant features from among many candidates. For ordinal classification, a sequential logit model and an ordered logit model are often employed. For feature subset selection in the sequential logit model, Sato et al. [22] recently proposed a mixed-integer linear optimization (MILO) formulation. In their MILO formulation, a univariate nonlinear function contained in the sequential logit model was represented by a tangent-line-based approximation. We extend this MILO formulation toward the ordered logit model, which is more commonly used for ordinal classification than the sequential logit model is. Making use of tangent planes to approximate a bivariate nonlinear function involved in the ordered logit model, we derive an MILO formulation for feature subset selection in the ordered logit model. Our computational results verify that the proposed method is superior to the L1-regularized ordered logit model in terms of solution quality.
引用
收藏
页码:1046 / 1053
页数:8
相关论文
共 32 条
  • [1] Agresti A., 2010, ANAL ORDINAL CATEGOR, V656
  • [2] AMEMIYA T, 1981, J ECON LIT, V19, P1483
  • [3] Arthanari T.S., 1993, Mathematical Programming in Statistics
  • [4] Beale E.M.L., 1970, P 5 INT C OP RES, P447
  • [5] Logistic Regression: From Art to Science
    Bertsimas, Dimitris
    King, Angela
    [J]. STATISTICAL SCIENCE, 2017, 32 (03) : 367 - 384
  • [6] Optimal classification trees
    Bertsimas, Dimitris
    Dunn, Jack
    [J]. MACHINE LEARNING, 2017, 106 (07) : 1039 - 1082
  • [7] OR Forum-An Algorithmic Approach to Linear Regression
    Bertsimas, Dimitris
    King, Angela
    [J]. OPERATIONS RESEARCH, 2016, 64 (01) : 2 - 16
  • [8] BEST SUBSET SELECTION VIA A MODERN OPTIMIZATION LENS
    Bertsimas, Dimitris
    King, Angela
    Mazumder, Rahul
    [J]. ANNALS OF STATISTICS, 2016, 44 (02) : 813 - 852
  • [9] Selection of relevant features and examples in machine learning
    Blum, AL
    Langley, P
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 245 - 271
  • [10] Different metaheuristic strategies to solve the feature selection problem
    Casado Yusta, Silvia
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (05) : 525 - 534